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FOREWORD 


This  report  was  prepared  under  Contract  No.  N00014-85-C-0814 
with  the  Office  of  Naval  Research  (ONR),  under  Phase  I  of 
the  Small  Business  Innovation  Research  (SBIR)  Program. 
Vista  Research  Corporation  expresses  its  appreciation  to 
ONR,  to  the  SBIR  Program,  and  to  Dr.  Neal  D.  Classman,  ONR 
Scientific  Officer,  for  their  support  in  sponsoring  this 
work.  The  author  of  this  report  was  Dr.  J._6eorge  Caldwell. 
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I.  INTRODUCTION  AND  SUMMARY 


A.  Study  Purpose 


''*^This  report  describes  the  results  of  a  study  to  examine  the 
feasibility  of  developing  fast  algorithms  for  estimation, 
prediction  and  control.  The  objective  of  the  study  was  to 
assess  the  likelihood  of  finding^  procedures  which  could  be 
faster,  in  terms  of  computer  running  time,  than  the 
classical  least-squares  method  of  parameter  estimation,  used 
e X tensi vely  to  develop  models  of  stocnastic  phenomena. 
While  the  least-squares  method  has  proved  its  worth  in  over 
a  century  of  use,  it  has  sane  serious  drawbacks,  stemming 
from  the  fact  that  it  requires  the  inversion  of  matrices. 
For  "large"  problems  such  as  the  problem  of  tracking  many 
missiles  or  processing  data  from  multiple  intelligence 
sensors  Tn  real-time ,  tne  computational  buraen  bit  thS 
least -squares  method  can  overwhelm  even  today's  powerful 
computing  systems.  Previous  attempts  to  solve  this  problem 
have  centered  on  the  development  of  faster  computers  (e.g., 
array  processors),  the  improvement  of  algorithms  for  matrix 
inversion ,  oTf  ETTe  simpiiticarion  of  rhs  iiiDdel  To  produce  a 
matrix  that  is  easier  to  invert.  In  general,  these 
approaches  have  not  been  successful  in  solving  the  problem. 
Modern  sensor  exploitation  systems,  for  example,  still 
cannot  operate  in  real-time  or  even  near -real -time.  ^  ^ 

The  present  study  proposed  to  adopt  a  totally  different 
approach  to  the  problem.  In  particular,  it  was  proposed  to 
investigate  methods  which  would  avoid  the 
compute ti ona 1 ly -i ntensi ve  process  of  matrix  inversion. 
Avoiding  this  procedure  could  reap  tremendous  benefits.  For 
example,  the  problem  of  tracking  a  missile  can  involve  the 
inversion  of  a  nine-dimensional  matrix  at  each  instant  that 
a  radar  pulse  is  received,  if  a  nine-component  state  vector 
is  used  to  represent  missile  position,  velocity,  and 
acceleration. 


This  project  was  supported  as  a  Phase  I  project  of  the  Small 
Business  Innovation  Research  (SBIR)  program.  The  objective 
of  a  Phase  I  SBIR  project  is  to  assess  the  feasibility  of  a 
proposed  concept  and  to  develop  a  plan  for  developing  the 
concept.  If,  based  on  the  Phase  I  effort,  the  proposed 


concept  appears  to  have  merits  then  development  of  the 
concept  may  be  funded  under  Phase  II  of  the  SBIR  progreun. 

B.  Study  Results 

This  Phase  I  project  has  successfully  accomplished  all  of 
the  tasks  identified  in  the  proposalr  and  established  the 
feasibility  of  the  proposed  concept.  Seven  tasks  were 
proposed  to  be  accomplished  in  this  study: 

Task  1.  Development  of  Criteria  for  Comparing 
Alternative  Estimation  Schemes 

Task  2.  Development  of  Test  Cases 

Task  3.  Implementation  of  the  Single-Variable  Linear- 
Model  Case 

Task  4.  Extension  to  the  Multiple-Variable  Linear- 
Model  Case 


Task  5.  Comparison  of  Methods 

Task  6.  Design  of  Phase  II  Work  Plan 

Task  7.  Preparation  of  Final  Report 

The  results  of  each  of  the  seven  study  tasks  are  summarized 
in  the  paragraphs  that  follow. 

Task  1.  Development  of  Criteria  for  Comparing  Alternative 
Estimation  Schemes 

A  total  of  seven  criteria  were  identified  and  retained  as 
useful  for  describing  the  performance  of  alternative 
estimation  algorithms.  These  criteria  address  the  following 
performance  aspects: 

o  computational  speed 

o  computer  storage  requirements 

o  precision  of  the  model  parameter  estimates 

o  bias  of  the  model  parameter  estimates 

o  accuracy  of  the  model  parameter  estimates 

o  precision  of  model-based  predictions 


o  numerical  stability  of  the  algorithm 

"Computational  speed"  refers  to  the  time  required  to  analyze 
the  data  and  produce  estimates  of  the  model  being  used  to 
describe  the  data.  "Computer  storage  requirements"  refers 
to  the  total  amount  of  direct-access  memory  ("core") 
required  to  implement  the  algorithm.  "Estimate  precision" 
refers  to  the  amount  of  variability  of  the  estimates  in 
repeated  data  samples.  "Estimate  bias"  refers  to  the 
difference  between  the  expected  (average)  value  of  the 
parameter  estimates  in  repeated  data  samples  and  the  true 
values  of  the  pareuneters.  "Estimate  accuracy"  is  a  combined 
measure  of  precision  and  bias.  "Prediction  precision" 
refers  to  the  closeness  of  the  model-based  predictions  to 
actual  future  values.  "Numerical  stability"  refers  to  the 
ability  of  an  algorithm  to  converge  to  a  desired  answer. 

Specific  measures  were  determined  for  each  of  the  preceding 
concepts.  Because  of  project  resource  limitations,  however, 
it  was  not  possible  to  develop  computer  software  to 
determine  numerical  values  for  all  of  the  measures. 


Task  2.  Development  of  Test  Cases 


It  was  decided  to  test  the  performance  of  alternative 
algorithms  on  sixteen  data  sets.  All  of  these  data  sets 
involve  a  single  dependent  variable  ("y")  and  a  number  (m) 
of  independent  variables  ("x's").  In  each  case  the  model 
used  to  generate  the  data  is  of  the  form: 


Yj  =  bg  +  Zbi  Xij  +  ej 


*  ‘’o  +  ij  'b  +  ej 


This  model,  called  a  linear  statistical  model,  specifies  the 
relationship  of  the  dependent  variable  y  to  m  independent, 
or  explanatory,  variables  <  j  ,  X2  j  ,  .  .  .  ,  Xn,  j  )  .  The 

variables  b  and  bi,bo,...bn,  are  constants,  called 
regression  coexficients.  They  are  specified  when  generating 
the  test  data,  and  are  to  be  estimated  by  the  estimation 

algorithm.  The  e.  are  called  model  error  terms,  or  model 
residuals.  They  are  a  sequence  of  uncorrelated  random 
variables  with  mean  zero  and  standard  deviation  SIG.  The  m 
x's  are  also  random  variables  with  zero  mean  and  variance 
matrix  e*sigma2  (where  the  asterisk  denotes 
multiplication).  The  test  cases  vary  in  the  ratio  of  SIG  to 
SIGMA  (models  for  which  the  ratio  SIG/SIGMA  is  low  are 


easier  to  estimate)#  and  the  strength  of  the  correlations 
among  the  x's  (if  the  x's  are  uncorrelated#  i.e.#  z  is  the 
identity  matrix#  the  estimation  is  easy;  if  the  x's  are 
highly  correlated#  the  estimation  is  difficult).  The  values 

of  SIG#  SIGMA#  I  #  b  and  b  are  collectively  referred 
to  as  the  "model  parameters."  ” 

The  test  cases  considered  were  as  follows: 

Test  Cases  1-4:  m®l  x#  value  of  SIG/SIGMA  varies 
from  low  to  high 

Test  Cases  5-8:  m=3  x's#  correlation  of  x's  varies 
from  none  to  high 

Test  Cases  9-12:  ms6  x's#  correlation  of  x's  from 
none  to  high 

Test  Cases  13-16:  m=10  x's#  correlation  of  x's  from 
none  to  high 

The  number  (m)  of  independent  variables  (x's)  represents  the 
"dimensionality"  of  the  estimation  problem.  In  the 
classical  least-squares  approach#  it  is  necessary  to  invert 
a  matrix  of  order  m  in  order  to  estimate  the  model 
parameters . 

In  social  science  applications#  the  dimensionality  of  a 
linear  regression  model  can  be  quite  high#  e.g.#  m  =  25  to 
50  (e.g.#  there  can  be  a  model  coefficient  corresponding  to 
every  possible  response  to  a  socioeconomic  or  demographic 
question  included  in  a  survey  questionnaire).  In  industrial 
and  scientific  applications#  the  number  of  explanatory 
variables  may  vary  from  small  to  large#  but  the  researcher 
is  often  able  to  specify  the  values  of  the  independent 
variables#  so  that  even  if  there  are  many  of  them,  special 
procedures  (e.g.,  fractional  factorial  experimental  designs) 
can  be  used  to  avoid  the  explicit  inversion  of  a  matrix.  In 
military  applications#  the  value  of  m  is  often  moderate  or 
small.  For  example#  in  the  application  of  tracking  a 
missile#  the  position#  velocity#  and  acceleration  of  the 
missile  can  be  specified  by  a  nine-component  state  vector# 
and  the  estimation  of  the  parameters  of  the  statistical 
model  (called  a  Kalman  filter)  requires  inversion  of  a  nine 
by  nine  matrix.  The  test  cases  specified  above  include  the 
usual  dimensionality  range  of  interest  for  many  military 
applications.  For  all  test  cases#  the  number  of 
observations  generated  (i.e.#  the  sample  size)  was  n  =  100. 
Later  study  should  examine  the  effect  of  varying  sample 
size;  with  the  resources  available  to  this  project#  it  was 


I 

I 

I 

I 


not  possible  to  exaunine  a  very  large  number  of  test  cases / 
and  the  decision  was  made  to  hold  sample  size  constant  for 
all  test  cases. 

A  computer  program,  called  SIMULA,  was  written  to  implement 
the  generation  of  the  test  case  data.  A  source  code  listing 
of  that  progr2un  is  presented  in  Appendix  A. 

Task  3.  Implementation  of  the  Single-Variable  Linear-Model 
Case 

In  the  1940's  and  1950's,  some  work  was  done  in 
investigating  model  estimation  procedures  that  were 
alternatives  to  the  classical  least-squares  procedure.  Two 
of  these  procedures  are  referred  to  as  the  "Wald"  method  and 
the  "Bartlett"  method.  They  were  designed  for  application 
to  the  case  in  which  there  was  a  single  explanatory  variable 
(i.e.,  m  =  1).  A  computer  program  was  written  in  this 
project  to  implement  both  of  these  procedures. 


y;  As  one  of  the  first  steps  in  this  study,  it  was  proposed  to 

j[g  compare  the  performance  of  the  Wald  and  Bartlett  estimation 

procedures  to  the  performance  of  the  classical  least  squares 
method,  in  order  to  illustrate  the  utility  of  the  criteria 
y!  and  associated  performance  measures  proposed  to  compare 

alternative  estimation  procedures. 
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This  comparison  demonstrated  that  several  of  the  suggested 
criteria  could  be  applied  to  measure  algorithm  performance. 
For  the  single-explanatory-variable  example,  however,  the 
comparison  is  not  very  revealing.  The  estimation  of 
parameters  for  single-explanatory-variable  (m  ®  1)  models  is 
very  easy  and  fast  with  any  of  the  methods,  so  that  there  is 
almost  no  variation  in  the  performance  measures. 

It  was  not  possible  to  implement  all  of  the  performance 
measures  in  this  project,  because  of  resource  limitations. 
For  example,  one  problem  that  arose  was  that,  for  the 
microcomputer  software  used  in  this  study,  the  system  timer 
could  not  be  accessed  by  the  FORTRAN  compiler.  It  was 
decided  not  to  allocate  project  resources  to  the  development 
of  an  assembly-language  timer  that  could  be  linked  to  the 
FORTR  AN  -  comp  i  1  e  d  object  modules.  Instead,  timing 
measurements  were  made  externally  (manually,  by  direct 
visual  observation),  and  they  are  hence  approximate.  For 
other  performance  measures  (e.g.,  the  parameter  accuracy 
measures),  it  would  have  been  necessary  to  replicate  a  large 
number  of  sample  cases  to  determine  numerical  estimates  of 
these  measures.  Once  again,  project  resources  were  not 
sufficiently  ample  to  accomplish  this.  Although  it  was  not 


« 


mmmm 


I 

I 

I 


§ 


£ 


I 

S 

V. 


%'• 

S' 

^  • 


possible  to  implement  all  of  the  suggested  performance 
measures  in  this  Phase  I  effort,  it  is  quite  feasible  to  do 
so  with  some  additional  resources,  and  this  should  be  done 
as  part  of  Phase  II,  if  Phase  II  is  funded. 

Task  4.  Extension  to  the  Multiple-Variable  Linear-Model 

Task  4  was  the  central  task  of  the  proposed  study.  The 
objective  of  this  task  was  to  determine  fast  algorithms  for 
estimating  parameters  of  models  containing  more  than  one 
explanatory  variable.  In  this  project,  we  sythesized  and 
analyzed  a  number  of  algorithms  that  represent  extensions  of 
the  Wald-Bartlett  methods.  They  are  iterative  methods,  and 
will  be  referred  to  as  "iterative  Wald-Bartlett"  methods. 
Several  variations  of  this  method  were  considered,  and  the 
one  that  worked  best  was  selected  for  detailed  examination. 
This  method  is  described  in  detail  in  this  report,  and  all 
of  the  performance  assessments  that  are  presented  in  this 
report  are  for  this  method. 

The  thrust  of  this  project  was  to  compare  the  performance  of 
new  estimation  techniques  to  the  performance  of  the 
classical  least -squares  technique.  In  order  to  do  this,  a 
computer  program  was  required  that  could  perform  the 
classical  least-squares  computations.  We  implemented  the 
least-squares  estimation  procedure  by  using  the  Gauss-Jordan 
method  of  solving  the  normal  equations  (i.e.,  inverting  the 
correlation  matrix).  This  procedure  is  described  in  Chapter 
3  of  Cooley  and  Lohnes,  Multivariate  Procedures  for  the 
Behavioral  Sciences  (Reference  8),  and  is  the  basis  for  the 
least-squares  estimation  procedure  presented  in  the  IBM 
Scientific  Subroutine  Package  (Reference  9).  The 
least -squares  algorithm  used  in  this  project  was  based  on 
subroutines  available  from  the  IBM  Scientific  Subroutine 
Package,  adapted  to  the  Microsoft  (R)  FORTRAN  compiler  that 
was  available  to  the  microcomputer  used  in  this  project. 
Thorough  documentation  of  the  subroutines,  including 
commented  code,  is  included  in  Reference  9.  A  source  code 
listing  of  the  algorithm  used  in  this  project  is  presented 
in  Appendix  A.  The  listing  in  Appendix  A  does  not  include 
any  comments  in  the  subroutines,  in  order  to  avoid  possible 
copyright  infringement. 

The  computer  program  source  code  for  the  iterative 
Wald-Bartlett  method  and  the  classical  least-squares  method 
is  also  presented  in  Appendix  A.  These  programs  are  written 
in  FORTRAN  II,  an  early,  unstructured  version  of  FORTRAN. 
That  version  was  used  since  it  was  the  version  implemented 
by  the  Microsoft  FORTRAN  compiler  used  on  this  project. 
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Task  5.  Comparison  of  Methods 


Task  5  was  concerned  with  comparison  of  the  performance  of 
the  iterative  Wald-Bart lett  method  and  the  classical 
least -squares  method,  applied  to  estimate  the  parameters  of 
the  sixteen  test  cases  developed  in  Task  2.  The  results 
were  very  interesting. 

First,  the  performance  of  both  the  iterative  Wald-Bartlett 
and  the  classical  least-squares  method  depends  on  the  nature 
of  the  problem.  For  "easy"  problems,  in  which  there  are  few 
x's  or  they  are  uncorrelated,  both  methods  work  well. 
Second,  in  problems  of  low  to  moderate  difficulty,  there 
does  not  appear  to  be  an  appreciable  speed  difference 
between  the  classical  least-squares  algorithm  and  the 
alternative  algorithm  synthesized  in  this  study. 

Third,  both  the  classical  least-squares  and  the  iterative 
Wald-Bartlett  methods  have  difficulties  with  very  difficult 
problems  (m  large  and  the  x's  highly  correlated).  The 
really  significant  result  that  was  observed  in  this  case  was 
that,  whereas  the  classical  method  may  fail 
catastrophically,  producing  totally -absurd  results,  the 
iterative  Wald-Bartlett  method  is  not  particularly  fast,  but 
it  determines  an  estimated  model  that  produces  reasonably 
close  predictions.  (Note:  The  original  number  of  test  cases 
planned  to  be  examined  was  12.  It  was  after  observing  this 
result  that  we  added  four  more  test  cases,  representing 
singular  covariance  matrices,  to  examine  this  phenomenon  in 
greater  detail.) 

The  implications  of  this  result  are  very  significant,  and 
probably  outweighs  the  importance  of  speed  in  most 
applications,  particularly  since  the  processing  speed 
differences  between  the  iterative  Wald-Bartlett  and  the 
classical  least  squares  algorithm  are  not  great.  For 
example,  in  an  embedded-computer  application  (e.g.,  a 
tracker  on  an  unmanned  missile)  it  may  be  very  desirable  to 
have  a  "robust"  estimation  or  prediction  procedure  —  one 
that  does  not  fail  catastrophically.  Moreover,  the  problem 
causing  the  catastrophic  failure  of  the  least-squares  method 
(the  failure  of  the  algorithm  to  be  able  to  invert  a 
"near ly -si ngular "  matrix)  is  one  that  has  plagued  data 
analysts  for  years  —  ever  since  the  widespread  use  of 
digital  computers  to  implement  the  least-squares 
methodology.  Matrix  inversion  problems  in  statistical 
analysis  were  not  severe  or  widespread  prior  to  the  1960 's, 
when  most  computations  were  done  using  mechanical 
calculators.  Most  statistical  calculations  were  done  by 
masters -level  mathematicians,  and  the  problems  were  kept 
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small  or  designed  to  avoid  the  inversion  of  large  matrices. 
Starting  in  the  1960 's,  however,  computer  packages  and 
computing  resources  were  widely  available,  and  were  being 
applied  in  many  cases  by  researchers  who  had  little  or  no 
appreciation  of  the  matrix-inversion  problem  inherent  in  the 
1 e a s t - sq ua r e s  method.  With  the  large  number  of 
non-statisticians  using  statistical  linear-model  packages 
(e.g.,  multiple  regression  packages)  and  the  low  precision 
of  many  microcomputers,  this  problem  arises  frequently. 
Consulting  statisticians  are  often  called  in  on  regression 
analysis  investigations  when  regression  analysis  packages 
produce  meaningless  results  due  to  a  matrix-inversion 
problem  caused  by  a  near-singularity  in  a  correlation  matrix 
or  a  linear  dependency  in  the  x*s.  With  the  growing  number 
of  microcomputers,  and  the  increasing  number  of 
non-statisticians  using  regression  analysis  programs  (and 
the  many  other  statistical  procedures  that  require  matrix 
inversion),  there  is  a  need  for  statistical  estimation 
procedures  that  do  not  fail  when  near-singularities  or 
linear  dependencies  are  present  in  the  data.  The  demand  for 
"robust"  statistical  estimators  probably  represents  a  far 
greater  commercial  value  than  the  demand  for  high-speed 
algorithms,  since  the  demand  for  real-time  processing 
represents  only  a  small  portion  of  the  total  demand  for 
statistical  estimation.  This  aspect  should  be  explored 
further  in  Phase  II. 

In  general.  Task  5  succeeded  in  demonstrating  the 
feasibility  of  the  proposed  approach.  Although  the 
iterative  Wald-Bart lett  method  does  not  appear  to  be 
substantially  faster  than  the  classical  least-squares 
method,  it  appears  to  be  much  more  "robust"  than  the 
classical  least-squares  method.  Also,  since  the  iterative 
Wald-Bart  let  t  method  is  based  on  "order  statistics,"  it 
would  be  much  less  sensitive  to  data  errors,  such  as 
"outliers,"  than  the  classical  method.  The  results  of  the 
Phase  I  study  suggest  that  additional  exploration  of  this 
area  would  be  very  beneficial. 

Task  6.  Design  of  Phase  II  Work  Plan 

The  results  of  this  Phase  I  study  have  demonstrated  the 
feasibility  of  determining  fast,  robust  estimators.  This 
study  centered  on  the  synthesis  of  a  particular  class  of 
estimator,  however,  and  that  estimator  certainly  does  not 
represent  a  final  solution  to  the  general  problem.  Although 
the  iterative  method  considered  in  this  study  does  not  fail 
catastrophically  in  difficult  or  ill-conditioned  problems, 
as  does  the  classical  least-squares  method,  it  is  not  very 
fast  in  such  cases.  The  present  feasibility  study  has 
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revealed  the  promise  of  the  proposed  approach,  but 
substantial  development  effort  is  necessary  to  produce  an 
estimation  method  that  works  well  (i.e.,  is  both  fast  and 
robust)  in  all  cases.  We  believe  that  the  resources 
available  in  Phase  II  can  accomplish  that  development 
effort,  and  have  outlined  a  research  plan  for  implementing 
the  effort. 


It  is  proposed  that  the  Phase  II  effort  be  redirected  from 
the  goal  originally  proposed  for  Phase  I.  The  Phase  I 
research  was  directed  solely  to  the  problem  of  finding  fast 
algorithms.  Based  on  the  Phase  I  results,  however,  it 
appears  that  there  may  be  more  potential  (both  in  terms  of 
project  success  and  commercial  value)  in  attempting  the 
development  of  robust  algorithms.  The  serendipitous 
discovery  that  the  synthesized  iterative  method  could 
produce  solutions  to  problems  for  which  the  classical 
least-squares  method  failed  catastrophically  probably 
outweighs  the  promise  of  a  fast  algorithm,  in  terms  of  both 
military  and  commercial/industrial  significance.  Embedded 
processors  in  military  weapon  systems  require  software  that 
is  robust,  i.e.,  does  not  fail  catastrophically  under 
certain  circumstances.  The  current  effort  has  demonstrated 
that  it  is  indeed  possible  to  develop  such  estimators.  The 
growing  use  of  microcomputers  in  commercial  and  industrial 
applications  will  create  a  growing  demand  for  estimation 
procedures  that  do  not  fail  in  ill-conditioned  problems,  and 
do  not  require  the  participation  of  a  professional 
statistician  to  assure  their  convergence  to  a  correct 
solution.  These  applications  include  not  only  multiple 
linear  regression  models  of  the  sort  addressed  in  this 
study,  but  the  whole  range  of  modern-day  data  analysis 
procedures  (multivariate  analysis  of  variance,  factor 
analysis,  canonical  correlation  analysis,  discriminant 
analysis,  and  time  series  analysis  models),  since  all  of 
these  methods  involve  matrix  inversion,  which  is  the  source 
of  the  slowness  and  potential  for  catastrophic  failure  of 
these  methods.  The  availability  of  robust  algorithms  would 
be  particularly  beneficial  in  time  series  applications 
(e.g.,  "Box-Jenkins"  analysis),  where  iterative  estimation 
methods  are  often  employed,  and  convergence  problems  are 
encountered  by  non-statistician  users. 

It  is  proposed  that  Phase  II  be  a  two-year  effort,  staffed 
at  approximately  2-5  persons  per  year.  A  description  of  the 
proposed  tasks  to  be  addressed  in  the  Phase  II  effort  is 
included  in  this  report. 

Task  7.  Preparation  of  Final  Report 


This  document  describes  the  activities,  results,  and 
conclusions  of  the  Phase  I  study.  In  addition,  it 
identifies  the  effort  that  should  be  implemented  in  Phase 
II,  in  order  to  complete  the  development  of  fast,  robust 
algorithms  for  estimation,  prediction,  and  control. 

This  Phase  I  study  has  accomplished  each  of  the  seven  tasks 
identified  and  described  in  the  proposal,  in  the  proposed 
six-month  time  frame.  We  believe  that  our  success  in 
accomplishing  all  of  the  proposed  tasks,  on  schedule,  augurs 
well  for  the  accomplishment  of  the  Phase  II  objectives  as 
well . 

C.  Orqanization  of  the  Report 


The  remainder  of  this  report  consists  of  four  additional 
sections  and  two  appendices.  Section  II  (“Background") 
describes  the  motivation  for  the  proposed  research  effort, 
both  relative  to  the  requirement  for  fast  algorithms  and  for 
robust  algorithms.  Section  III  ("Project  Approach") 
describes  the  methodology  for  conducting  the  Phase  I 
investigation.  The  methodology  consists  of  the 
specification  of  performance  criteria,  the  simulation  of 
test-case  data  with  which  to  test  the  performance  of 
alternative  algorithms,  the  synthesis  of  one  or  more 
candidate  "fast  algorithms,"  and  the  comparison  of  the 
performance  of  the  synthesized  method  to  the  classical 
least-squares  method  using  the  specified  criteria  and  the 
test-case  data.  Section  IV  ("Simulation  Results")  presents 
the  results  of  this  simulation  study.  Section  V 
("Conclusions  and  Recommendations")  summarizes  the  study 
conclusions  and  describes  the  additional  research  and 
development  thata  is  needed  to  develop  and  implement  the 
proposed  concept.  The  Appendices  contain  source  code 
listings  of  all  computer  programs  used  in  this  study 
(Appendix  A),  and  detailed  output  listings  (Appendix  B) . 
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II.  BACKGROUND 

A.  General  Motivation  for  the  Proposed  Study 

This  report  describes  the  results  of  a  research  study  to 
assess  the  feasibility  of  developing  fast  algorithms  for 
real-time  estimation*  prediction  and  control.  Such 
algorithms  would  provide  a  solution  to  a  critical  problem 
faced  in  both  industrial  and  military  applications  —  the 
fact  that  the  algorithms  used  to  implement  state-of-the-art 
statistical  estimation,  prediction  and  control  techniques 
are  far  too  slow  for  many  real-time  or  near-real-time 
applications  of  high  interest,  even  using  the  fastest 
computers . 

The  slowness  of  statistical  correlation/tracking  techniques 
such  as  the  Kalman-Bucy  filter  was  one  of  the  principal 
reasons  for  the  failure  of  the  ballistic  missile  defense 
program  of  the  1960 's.  This  problem  has  still  not  been 
satisfactorily  solved,  even  with  substantial  improvements  in 
computer  processing  speed  and  direct-access  storage 
capability.  Modern  command,  control,  communications,  and 
intelligence  (C^i)  systems  such  as  those  intended  to 
support  the  AirLand  Battle  concept,  the  Air  Force’s  Tactical 
Air  Control  Center,  or  Naval  tactical  data  systems  require 
multi  sensor  correlation/fusion  to  be  performed  in  real  time 
or  near  real  time.  The  success  of  such  systems  is  in 
jeopardy  to  the  extent  that  they  rely  on  data  processing  by 
traditional  statistical  algorithms. 

In  addition  to  military  applications,  the  availability  of 
fast  prediction  and  control  algorithms  would  assist  control 
of  rapid,  time-varying  processes  occurring  in  industry  and 
commerce  (e.g.,  hot  steel  finishing  mills  and  air  traffic 
control),  for  which  available  prediction  and  control  methods 
are  generally  heuristic,  because  of  the  current  inability  to 
conduct  on-line  system  identification  (and  thereby  use 
model-based  predictors/controllers) . 

B.  The  Requirement  for  Fast  Prediction/Control  Algorithms 

Over  the  past  two  decades,  tremendous  advances  have  been 
made  in  the  development  of  powerful  statistical  estimation 
techniques.  The  applications  associated  with  these 
techniques  cover  a  wide  variety  of  fields,  including 
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scientific,  economic,  industrial,  and  military  applications. 
In  general,  present-day  statistical  estimation  procedures 
represent  extensions  of  the  work  of  Gauss,  who  developed  the 
least-squares  method  of  estimating  the  position  of  an 
asteroid,  based  on  observations  which  contained  measurement 
errors.  Gauss's  method,  or  the  method  of  least  squares, 
consists  in  determining  a  set  of  model  parameters  in  such  a 
way  that  the  sum  of  squares  of  the  differences  between  the 
actual  measurements  and  the  position  estimates  based  on  the 
model  is  minimized. 

A  salient  characteristic  of  the  method  of  least  squares  is 
that  it  requires  computation  of  a  "cross-products"  matrix 
(or  a  covariance  matrix  or  a  correlation  matrix),  and  it 
requires  the  inversion  of  a  matrix  whose  order  depends  on 
the  number  of  parameters  being  estimated.  In  many 
applications,  this  presents  no  problems,  since  statistical 
analysis  need  not  be  done  in  "real-time."  Instead,  in  most 
applications  the  analyst  may  collect  the  observed  data,  and 
then  determine  the  parameter  estimates  "off-line"  in  the 
hours,  days,  or  weeks  that  follow.  In  some  applications, 
however,  such  as  those  involving  the  tracking  of  fast-moving 
obj  ects  (such  as  missiles,  airplanes,  or  satellites),  or  the 
control  of  rapidly-changing  systems  (such  as  certain 
industrial  processes),  it  is  necessary  to  estimate  the 
parameters  "on-line,"  in  real-time  or  near-real-time.  In 
problems  in  which  attention  centers  on  only  one  or  a  few 
processes  at  a  time  (e.g.,  a  small  number  of  objects  are 
being  tracked,  or  a  small  number  of  electromagnetic  emitters 
are  being  monitored),  the  computational  burden  is  not 
severe.  If  numerous  tracks  (or  emitters)  are  involved,  or 
the  underlying  system  changes  "too  fast,"  the  method  of 
least-squares  breaks  down  —  the  computational  requirements 
may  saturate  even  the  most  powerful  (fastest,  largest) 
computers  available. 

In  an  attempt  to  remedy  this  problem,  a  tremendous  amount  of 
effort  has  been  expended  on  the  development  of 
computationally  efficient  algorithms  for  determining 
least -squares  estimates.  In  1960,  Kalman  and  Bucy  developed 
a  recursive  solution  to  the  least-squares  estimation 
problem,  now  called  the  Kalman  filter  (or  Kalman-Bucy 
filter).  Kailath  (Reference  1)  provided  a  comprehensive 
survey  of  over  600  references  on  filtering.  Aggarwal 
(Reference  2)  describes  the  problem  of  developing 
least-squares  algorithms  that  are  numerically  stable  and 
computationally  efficient. 

One  approach  to  reducing  the  computational  complexity  of 
tracking  algorithms  is  linearization.  While  this  procedure 
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helps,  it  is  not  sufficient.  As  an  example  of  the 
inadequacy  of  linearization,  it  is  noted  that  the  ballistic 
missile  trackers  proposed  in  the  1960 's  were  in  fact 
simplified  nine-component  Kalman  filters  in  which  the 
equations  of  motion  had  been  linearized,  and  the  covariance 
matrix  radically  simplified.  Yet  this  approach  still 
failed,  because  of  the  tremendous  computational  requirements 
of  the  general  linear  model  and  the  least-squares  estimates. 

As  a  further  example  of  the  inadequacy  of  linearization,  it 
is  noted  that  neither  the  Kalman  filter  nor  the  Box-Jenkins 
( autoregressi ve-i ntegra ted-roovi ng-a verage  time  series) 
models  could  outperform  the  heuristic  alpha-beta  tracker,  in 
air  traffic  control  studies  of  the  early  1970 's.  As  a  final 
example,  it  appears  that  in  complex  correlation/tracking 
problems  (e.g.,  satellite  ocean  surveillance,  intelligence 
analysis  of  electromagnetic  emissions  for  unit 
identification),  heuristic  nonlinear  "algorithmic" 
procedures  work  better  than  procedures  derived  from 
linear-model  theory. 

The  potential  exists  to  reduce  the  computational  burden  of 
prediction/control  algorithms  by  a  factor  of  several  orders 
of  magnitude.  Furthermore,  this  reduction  can  probably  be 
achieved  at  very  modest  cost  --  a  few  person-years  of 
research  effort.  This  investment  is  negligible,  when 
compared  to  the  massive  research  investment  that  will  be 
required  to  develop  even  a  hundred-fold  increase  in  computer 
speed,  through  the  development  of  an  operational  large-scale 
parallel-processor  or  bio-chip  technology. 

In  addition  to  improvements  in  the  computational  efficiency 
of  least-squares  algorithms,  tremendous  gains  have  been  made 
in  the  speed  of  the  computers  which  perform  the 
1  e  a  s  t  -  s  q  u  a  r  e  s  computations.  Advances  such  as  Very 
Large-Scale  Integration  (VLSI)  technology  have  increased 
computer  processing  speeds  and  di rect -access  storage 
capabilities  by  a  factor  of  one  thousand  since  the 
mid-1960's,  when  the  feasibility  of  performing  least-squares 
tracking  of  incoming  ballistic  missiles  was  seen  to  exceed 
available  computational  capabilities. 

Despite  the  tremendous  computational  gains  of  the  last 
fifteen  years,  however,  it  appears  that  further  advances  may 
be  elusive.  Computer  processing  technology  is  now  running 
up  against  physical  limits,  such  as  the  time  required  for 
electronic  signals  to  propagate  along  the  wires  inside  the 
computer.  As  reported  in  a  recent  issue  of  Defense 
Science  (Reference  3),  advances  in  computational  speed  are 
leveling  off  (see  Figure  1).  In  order  to  achieve  processing 
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Figure  1. 


The  Exponential  Rate  of  Growth  in  Computer 
Processing  Speed  is  Slowing 
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speed  increases  of  two  orders  of  magnitude  or  more  with 
physical  devices,  it  appears  that  parallel  processing 
architectures  will  be  required.  Unfortunately,  formidable 
problems  are  associated  with  the  development  of  large-scale 
parallel  processors  and  associated  software,  and  a 
substantial  amount  of  basic  research  and  development  will  be 
required. 

To  achieve  further  increases  will  probably  require  the 
development  of  "bio-chips,"  in  which  synthetic  organic 
molecules  perform  the  binary  switching  functions  of 
present-day  physical  switches  such  as  silicon  or  gallium 
arsenide  field-effect  transistors.  In  principle,  the  use  of 
bio-chip  molecular  switches  could  lead  to  circuit  elements 
one  thousand  times  smaller  than  may  be  achieved  by 
conventional  semiconductors.  Developments  in  this  area  may 
be  slow,  however,  since  even  the  most  promising  of  the 
bio-chips  —  the  soli ton  switch  —  is  still  theoretical. 

In  summary,  it  does  not  appear  likely  to  significantly 
improve  the  computational  speed  of  least-squares  algorithms, 
and  the  likelihood  of  realizing  substantial  increases  in 
computer  speed  very  soon  is  not  high.  In  order  to  achieve 
substantial  gains  in  processing  speed  in  the  near  term, 
then,  it  appears  that  the  most  promising  approach  is  to 
develop  estimation  procedures  which  impose  a  substantially 
reduced  computational  burden. 

C .  Early  Accomplishments  in  the  Area  of  Low-Compute 
Estimation  Procedures 


1 .  Experimental  Design  Applications 

In  the  1930 's  and  1940's,  tremendous  advances  were  made  in 
the  development  and  application  of  the  general  linear 
statistical  model,  to  solving  problems  in  statistical 
experimental  design.  In  those  days,  however,  no  computers 
(other  than  human  beings)  were  available  for  solving 
large-scale  systems  of  linear  equations,  and  ingenious 
methods  were  developed  to  determine  designs  for  which  the 
estimators  could  be  determined  without  the  need  for  explicit 
matrix  inversion. 

The  advantage  in  the  field  of  experimental  design,  of 
course,  is  that  the  statistician  has  control  over  the  values 
of  the  explanatory  variables  of  the  model.  The  popular 
experimental  designs  developed  in  the  1930's  and  1940's 
(randomized  blocks,  Latin  squares,  fractional  factorial, 
par t i al ly -ba lanced  incomplete  blocks)  were  models  in  which 
the  designer  introduced  various  degrees  of  orthogonality 


into  the  "design  matrix"  of  the  model »  so  that  the  equations 
of  estimation  could  be  easily  solved. 


The  point  to  be  recognized  here  is  that/  in  the  face  of  a 
strong  requirement  to  develop  low-compute  estimation 
procedures  in  experimental-design  applications/  trenendous 
advances  were  made.  The  underlying  theory  was  ccanplex 
(Galois  theory  and  projective  geometries)  /  but  the 
computational  algorithms  that  resulted  were  extremely 
simple/  allowing  for  the  rapid  hand-solution  of  estimation 
problems  containing  large  numbers  of  variables. 

2.  Regression-Model  Applications 

In  regression  analysis/  the  statistician  does  not  always 
have  control  over  the  values  of  the  explanatory  variables/ 
as  is  the  case  in  experimental  design.  Nevertheless/  in  the 
period  1920-1950  significant  advances  were  made  in  the  field 
of  developing  low-compute  procedures  for  soving  regression 
problems .  The  Biometrika  Tables  for  Statisticians/  which 
were  published  in  1953  to  "reduce  the  labour  of  statistical 
arithmetic/"  included  tables  of  orthogonal  polynomials/ 
which  vastly  reduced  the  amount  of  computation  required  to 
produce  estimates  of  linear  contrasts.  These  tables  were  of 
invaluable  aid  to  statisticians  in  computing  estimates  of 
regression  model  parameters  until  about  1960/  when 
high-speed  digital  computers  became  generally  available  in 
research  facilities.  At  that  time,  it  appears  that  just 
about  all  effort  directed  toward  computational  simplicity 
ceased.  A  digital  computer  could/  in  minutes /  invert  large 
matrices  that  simply  could  not  be  inverted  manually. 

It  is  interesting  to  note  that  it  was  at  just  about  this 
same  time  that  Kalman  and  Bucy  developed  their  recursive 
scheme  for  determining  estimates  and  predictions  for  a 
linear  time-series  model.  The  computational  requirements  of 
their  method  were  staggering  from  a  manual  perspective/  but 
offered  no  difficulties  when  tackled  by  a  digital  computer. 
Over  the  next  twenty  years/  advances  were  made  in  the 
development  of  more  rapid  or  more  precise  algorithms  for 
implementing  the  Kal.  -Bucy  filter/  but  these  methods 
accepted  the  basic  linear-model-estimation  formulas  of  the 
recursive  filter  as  a  starting  point.  With  this  mind-set/ 
the  computational  requirements  of  the  Kalman-Bucy  filter 
were  never  substantially  reduced. 

In  a  serendipitous  fashion/  however/  it  is  interesting  to 
observe  that  some  advances  were  inadvertently  made  in  the 
development  of  low-compute  estimators  for  the 
n  o  n  -  t  i  m  e  -  s  e  r  i  e  s  linear  (regression)  model.  These 


development s ,  by  Wald  and  Bartlett,  are  described  in  the 
paragraphs  that  follow. 

In  its  simplest  form,  the  general  linear  statistical  model 
(which  forms  the  basis  for  modern  estimation,  prediction, 
and  control  algorithms)  may  be  written  as: 

^  «  X'£  +  e, 

where 

^  =  vector  of  observations; 

2  =  vector  of  parameters; 

X  =  matrix  of  explantory  variables  ("data  matrix"); 
e  =  error  vector, 

and  the  prime  (')  denotes  matrix  transposition.  In 
cor relati on/ tracki ng  and  fusion  problems,  the  form  of  the 
equations  changes  somewhat  (e.g.,  there  are  "model"  and 
"observation"  errors,  and  the  representation  is  usually  in 
terms  of  a  state  vector),  but  the  elementary  form  given 
above  will  serve  to  illustrate  the  nature  of  the  estimation 
algorithms. 

The  least -squares  estimate  of  the  parameter  £  is  given  by: 
£  «  (XX')*Xji 

where  the  asterisk  denotes  a  conditional  (generalized) 
inverse  of  a  matrix.  The  key  point  to  note  with  the 
least-squares  estimate  is  the  fact  that  it  involves  matrix 
products  and  matrix  inversion.  (As  the  model  becomes  more 
complex,  the  number  of  matrix  operations  increases.) 

In  the  simple  example  of  regression  analysis  (fitting  a 
straight  line),  the  above  model  reduces  to: 

^i  =  Pi  +  P2Xi  + 

where  the  index  i  denotes  the  i-th  observation  (i  = 
1 , 2  ,  .  .  .  ,  n )  ,  and  the  least -squares  estimates  of  the 
parameters  (the  intercept  and  slope  of  the  line)  are: 
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Pi  =  y  -  P2X 

where  x  and  y  denote  the  means  of  the  observed  x's  and  y's, 
respectively. 

In  1940,  Abraham  Wald  (the  "father"  of  statistical  decision 
theory  and  sequential  analysis)  proposed  (Reference  5)  a 
much  simpler  estimator  as  an  alternative  to  P2: 

P2  =  (y2  -  yi)/(X2  -  XI ) 

where  Xi  <Jenote  the  means  of  the  x-values  above 

and  below  the  median  (of  the  x's)  and  7i  and  yo  denote 
the  means  of  the  corresponding  y-values  (i.e. ,  the  y-values 
associated  with  the  x-values).  Wald  originally  proposed 
this  estimator  as  a  solution  to  the  problem  in  which  both 
the  X  variable  and  the  y  variable  are  subject  to  error.  It 
is  interesting  to  observe,  however,  that  Wald's  estimate 
requires  substantially  less  computation  than  the 
least-squares  estimator  --  4n  additions  and  one  division 
versus  4n  additions,  2n  multiplications  and  one  division, 
where  n  denotes  the  number  of  observations.  This  represents 
a  reduction  in  computer  time  by  an  order  of  magnitude. 
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Wald's  estimator  possesses  the  desirable  statistical 
property  of  consistency  (which  the  least  squares  estimate 
does  not,  in  the  errors-in-variables  problem),  but  the 
sampling  variance  of  the  estimator  is  larger  than  for  the 
least -squares  estimate.  This  inefficiency  may  be  overcome 
by  taking  a  slightly  larger  sample,  in  which  case  the  Wald 
estimate  still  has  the  computational  advantage. 

In  1949,  M.  S.  Bartlett  (Reference  6)  modified  Wald's 
estimator  by  dividing  the  ranked  x-variable  into  three 
equal-sized  groups,  and  forming  the  estimate 

P2  =  (y3  -  yi)/(X3  -  xj^) 

where  *  ^  y.  are  the  means  co^r  r  e  s  p  o  n  d  i  ng  to  the 

low-value  group  of  x's,  and  x-,  y-  are  the  means 
corresponding  to  the  high-value  group  of'^x's.  Bartlett's 
estimator  is  more  efficient  (i.e.,  has  lower  sampling 
variance)  than  Wald's  estimator,  and  requires  1/3  less 
computation . 
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In  1958  ,  J.  W.  Hooper  and  H.  Theil  (Reference  7)  extended 
the  Wald/Bartlett  method  of  grouping  to  the  case  of  multiple 
linear  regression  (in  which  there  is  more  than  one 
x-variable).  The  method  was  judged  somewhat  tedious  to 
implement,  however,  and  was  essentially  abandoned.  Note 
that  this  was  about  the  time  when  high-speed  digital 
computers  (e.g.,  the  IBM  650)  were  becoming  generally 
available  (at  least  in  the  major  universities  and  research 
centers),  and  so  there  was  at  that  time  no  longer  an 
incentive  to  prefer  the  Wald-type  estimators  to  the 
least -squares  estimators  on  computational  grounds.  (For  the 
errors-in-variables  problem,  for  which  the  Wald's  estimator 
was  orginally  developed,  a  method  by  J.  Durbin,  introduced 
in  1954,  was  generally  adopted  as  a  perf erred  method.  It  is 
more  (statistically)  efficient  than  the  Wald  and  Bartlett 
estimators,  but  is  not  relevant  to  the  present  problem 
because  it  requires  the  same  amount  of  computation  as  the 
least-squares  estimation  procedure.) 

It  is  interesting  to  observe  that  both  the  Wald  and  Bartlett 
estimators  may  be  derived  from  the  formula  for  the 
least-squares  estimates,  by  replacing  the  values  of  the 
explanatory  variables  by  +l's  and  -I's  in  the  case  of  the 
Wald  estimator,  and  by  +l's,  -I's,  and  O's  in  the  case  of 
the  Bartlett  estimator.  This  procedure  is  analogous  to  the 
procedure  of  determining  an  experimental  design;  the 
x-values  are  set  at  values  which  enable  the  equations  of 
estimation  to  be  solved  without  explicit  matrix  inversion. 
(The  difference  is,  of  course,  that  in  the  case  of 
experimental  design,  the  specified  x-values  are  actually 
used  in  the  experimentation  process,  whereas  in  the 
regression  case,  the  actual  (continuous)  x-value  is  replaced 
by  the  simpler  discrete  value.) 

The  point  to  the  above  is  that,  with  a  little  ingenuity,  it 
is  possible  to  develop  estimators  that  have  drastically 
reduced  compute  ti  ona:..  requirements,  over  those  of  the 
least-squares  estimates.  The  preceding  example  addresses 
the  simplest  situation  (one  dependent  variable,  one 
explanatory  variable),  with  no  matrix  multiplications  or 
inversions  required,  and  yet  a  reduction  in  computational 
requirenents  of  an  order  of  magnitude  were  realized.  In  the 
general  case  of  several  or  many  variables  (e.g.,  a  Kalman 
filter  for  a  nine-component  state  vector),  involving  many 
matrix  multiplications  and  inversions,  the  potential  for 
dramatic  computational  reductions  is  tremendous. 

A  review  of  the  statistical  literature  of  the  past  two 
decades  reveals  a  fixation  with  least-squares  estimation. 
To  be  sure,  some  new  estimators  have  been  introduced  (e.g., 


jacknife  estimators),  but  they  are  computationally  similar 
to  the  1  e  a  s  t  -  s  q  ua  r  e  s  estimates  (requiring  matrix 
multiplications  and  inversions),  and  have  similar 
computational  efficiencies.  It  appears  that  the  effort  to 
improve  computational  efficiency  has  been  conditional  on  use 
of  the  least -squares  approach,  rather  than  on  centering  on 
novel  estimation  procedures.  The  criteria  against  which  the 
procedures  are  invariably  judged  is  the  error-variance  of 
the  minimum-variance  linear  unbiased  estimator.  While 
restriction  to  this  criterion  may  be  reasonable  for  off-line 
statistical  estimation,  it  is  not  reasonable  to  restrict 
attention  to  this  single  criterion  for  the  large-scale 
on-line  (real-time)  estimation  situation.  We  believe  that, 
once  the  criteria  against  which  the  estimator  are  to  be 
judged  are  appropriately  modified,  significant  and 
substantial  improvements  will  follow. 


Parallel  with  Optimization  Theory 


In  a  sense,  the  situation  with  respect  to  the  use  of  the 
general  linear  statistical  model  and  the  least-squares 
estimates  parallels  the  situation  that  existed  in  the  1950 's 
in  the  field  of  optimization  theory.  At  that  time,  the 
principal  optimization  procedure  was  linear  programming. 
The  framework  of  linear  programming  did  not  suit  many 
practical  applications,  however,  and  so  alternative 
nonlinear  programming  methods  were  sought.  In  1963,  the 
Generalized  Lagrange  Multipliers  method  was  introduced  by  H. 
E.  Everett.  This  powerful  method  produced  very  fast 
solutions  to  very  large  optimization  problems  in  which  the 
objective  function  could  be  nonlinear,  non-convex,  and 
discontinuous.  The  GLM  method  had  a  tremendous  advantage 
over  previous  optimization  procedures,  in  that  it  did  not 
require  the  solution  of  a  large-scale  system  of  equations. 
Unfortunately,  the  GLM  method  is  restricted  to  problems  in 
which  the  objective  function  is  "separable,"  i.e.,  may  be 
expressed  as  a  certain  sum.  Furthermore,  it  was  not 
possible  to  guarantee  convergence  of  the  method,  and  in  some 
cases  convergence  could  be  slow. 


In  the  late  sixties,  Fiacco  and  McCormick  promoted  the  use 
of  "quadratic  penalty  fuctions"  to  solve  constrained 
optimization  problems.  This  approach  (called  the  Sequential 
Unconstrained  Minimization  Technique,  or  SUMT)  worked  well 
for  a  larger  class  of  problems,  but  in  general  it  did  not 
possess  the  great  speed  of  the  GLM  method.  Also,  the  method 
fails  in  a  fairly  wide  class  of  problems  in  which  the 
Hessian  matrix  is  "ill-conditioned"  (not  positive  definite). 


Finally,  also  in  the  late  1960 's,  the  two  methods  were 
combined  into  what  is  now  known  as  the  "generalized 
Lagrangian  method."  Hestenes  and  Powell  suggested  adding  a 
quadratic  penalty  function  to  the  Lagrangian  function, 
instead  of  the  objective  function,  as  is  done  in  the  SUMT 
method.  The  generalized  Lagrangian  method  converges  fast, 
and  does  not  exhibit  the  ill-conditioning  that  frequently 
occurs  in  the  original  penalty-function  method  (SUMT). 

Thus,  in  a  span  of  about  ten  years,  a  tremendous  leap 
forward  was  made  in  solving  constrained  optimization 
problems.  The  interesting  fact  to  note,  however,  is  that 
whereas  the  linear  programming  solution  (the  simplex  method) 
had  a  very  well-behaved  theory  associated  with  it,  and  was 
guaranteed  to  converge  in  a  predetermined  number  of  steps, 
the  generalized  Lagrangian  methods  possessed  no  such 
property  --  the  methods  are  defined  as  algorithms  (for 
adjusting  the  values  of  Lagrange  multipliers),  and  no 
definitive  statement  can  be  made  about  the  rate  of 
convergence. 

The  fact  remains,  however,  that  by  moving  out  of  the  very 
restrictive  linear-model  framework,  tremendous  advances  were 
quickly  realized.  Furthermore,  satisfactory  solutions  were 
not  achieved  by  "linearizing"  nonlinear  problems,  but  by 
developing  heuristic  algorithmic  procedures,  which  were 
demonstrated  to  work  well. 

While  the  problems  of  estimation,  prediction,  and  control 
are  statistical  in  nature,  they  are  also  optimization 
problems,  and  it  is  reasonable  to  conjecture  that  a 
tremendous  advance  in  speed  may  be  realized  by  applying  the 
ingenuity  and  heuristic  methods  that  worked  so  well  in  the 
field  of  constrained  optimization.  The  current  situation  — 
a  near-total  dependence  on  the  general  linear  statistical 
model  —  is  analogous  to  the  situation  in  which  the  field  of 
constrained  optimization  theory  found  itself  twenty-five 
years  ago.  It  would  appear  that  much  can  be  done. 

F.  The  Need  for  a  Robust  Estimator 


During  the  course  of  this  Phase  I  study,  it  was  observed 
that  the  algorithm  that  had  been  synthesized  as  a  candidate 
"fast"  algorithm  possessed  a  remarkable  property  —  it 
produced  reasonable  results  when  applied  to  "difficult" 
problems,  in  which  the  classical  least-squares  method  failed 
catastrophically.  The  "difficult"  problems  were  ones  in 
which  the  correlation  matrix  that  had  to  be  inverted  in  the 
classical  least-squares  method  was  "near-singular"  (i.e., 
had  a  small  determinant).  This  situation  arises  whenever 
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the  explanatory  variables  of  the  model  are  highly 
correlated.  In  this  case,  inverting  the  correlation  matrix 
(which  is  central  to  the  classical  least-squares  method)  is 
difficult  (in  a  numerical  analysis  sense),  particularly  for 
large  or  even  moderately  large  matrices  (e.g.,  m  =  10 
explanatory  variables).  What  happens  is  that  roundoff 
errors  (due  to  the  truncated  representation  of  real  nximbers 
in  the  computer)  ruin  the  matrix  inversion,  and  the  matrix 
inversion  algorithm  fails  to  produce  the  desired  inverse. 
The  least -squares  method  in  fact  fails  "catastrophically," 
in  the  sense  that  the  produced  results  are  generally  totally 
wrong  --  the  resultant  model  produced  by  the  method  may 
predict  even  worse  than  the  "trivial"  model  that  predicts 
the  mean  value  of  the  dependent  variable  for  all  values  of 
the  independent  variables.  This  problem  is  ameliorated 
somewhat  by  using  double  precision  arithmetic,  but  it  is  a 
particularly  troublesome  problem  in  a  microcomputing 
environment,  where  the  computer  word  length  is  short  (and 
the  precision  of  computation  is  low).  The  problem  is  a 
particularly  insidious  one,  since  many  statistical  analysis 
programs  do  not  warn  the  user  that  the  method  has  failed, 
and  if  the  results  do  not  appear  to  be  patently  absurd,  they 
may  be  accepted  as  correct. 

In  addition  to  the  problem  of  near -singular  correlation 
matrices,  the  problem  of  catastrophic  failure  may  arise  if  a 
a  problem  is  "ill-conditioned,"  or  "ill-specified."  This 
happens,  for  example,  if  there  is  a  linear  dependency  among 
the  independent  variables  (the  x's).  This  situation  often 
arises  in  social  science  applications.  In  a  survey 
questionnaire,  for  example,  the  respondent  may  be  as)ced  to 
select  an  answer  in  one  of  a  number  (e.g.,  five)  of 
categories.  In  the  analysis  of  the  survey  data,  the 
response  for  the  selected  category  is  coded  as  a  "1,"  and 
the  response  for  the  other  categories  are  coded  as  "O's."  A 
social  science  researcher  who  is  not  aware  of  the  matrix 
inversion  to  be  done  in  a  regression  analysis  may  include 
all  five  responses  as  variables  in  the  data  base,  and  in  a 
regression  model.  These  variables  are  linearly  dependent, 
however,  since  the  sum  of  all  five  category  responses  must 
equal  1.  The  presence  of  this  linear  dependency  in  the 
dependent  variable  will  cause  the  correlation  matrix  to  be 
singular.  Once  again,  the  least-squares  method,  if  applied 
to  this  problem,  will  fail  catastrophically.  This  case  is 
often  not  as  troublesome  as  the  "near-singular"  case, 
however,  since  the  presence  of  an  exact  linear  dependency 
may  result  in  a  computed  matrix  determinant  of  exactly  zero, 
and  some  computer  programs  check  for  this  condition. 
Because  of  roundoff  errors,  however,  the  computer  algorithm 
may  fail  to  recognize  the  singularity,  compute  a  nonzero 


numerical  value  for  the  determinant,  and  produce  a 
"solution,"  which,  unfortunately,  is  totally  wrong. 

This  situation  is  a  very  real  problem.  Many  users  of 
statistical  program  packages  are  not  trained  in  the  theory 
underlying  the  computations  and  are  unaware  of  the  pitfalls 
of  the  leas t -squares  method.  Researchers  in  social  science 
are  now  trained  in  university  curricula  in  how  to  apply  the 
statistical  procedure  of  multiple  regression  analysis,  but 
they  are  not  expected  to  know  matrix  algebra  and  are 
generally  not  trained  in  it  as  part  of  their  introduction  to 
statistics.  As  a  professional  consulting  statistician,  the 
author  of  this  report  has  been  retained  on  more  than  one 
occasion  to  "explain"  absurd  results  obtained  because  of  the 
problem  of  linear  dependencies  in  regression  analysis 
applications.  In  technical  terms,  the  classical 
least -squares  method  is  not  "robust"  with  respect  to  the 
presence  of  linear  dependencies  or  near-dependencies  (high 
correlations)  in  the  explanatory  variables. 

Statistical  theory  can  handle  the  presence  of  linear 
dependencies,  if  they  are  recognized.  In  such  cases,  the 
inverse  of  the  correlation  matrix  is  replaced  by  a 
"generalized  inverse"  or  "conditional  inverse."  Although 
this  theory  is  generally  taught  to  graduate  statistics 
majors,  however,  it  is  not  known  to  most  data  analysts. 
Moreover,  most  of  the  major  statistical  software  packages  do 
not  offer  this  capability. 

The  present  study  began  as  an  attempt  to  determine  the 
feasibility  of  finding  fast  algorithms  for  estimation, 
prediction,  and  control.  In  the  course  of  the  study,  an 
algorithm  was  developed  that  did  not  fail  catastrophically 
in  "difficult"  problems.  Upon  observing  this,  it  was 
decided  to  explore  the  performance  of  the  algorithm  in 
ill-conditioned  problems  containing  linear  dependencies. 
The  method  works  well  in  such  cases.  The  significance  of 
this  result  could  be  very  great  from  a  commercial  viewpoint, 
in  view  of  the  extensive  use  of  statistical  multiple 
regression  analysis.  Moreover,  virtually  all  multivariable 
statistical  analysis  procedures  involve  matrix  inversion. 
All  of  these  methods  are  subject  to  catastrophic  failure, 
and  are  candidates  for  application  of  a  method  that  does  not 
require  matrix  inversion. 


III.  PROJECT  APPROACH 


A.  Summary  of  Approach 

The  approach  proposed  for  this  study  consisted  of  four  major 
steps: 

1.  Development  of  Criteria  for  Comparing  Alternative 
Estimation  Algorithms 

2.  Generation  of  Test  Cases 

3.  Synthesis  of  Candidate  Algorithms 

4.  Comparison  of  the  Performance  of  Candidate 
Algorithms  to  the  Perfomance  of  the  Classical 
Least-Squares  Algorithm 

Each  of  these  steps  is  described  in  detail  in  the  following 
subsections . 

B .  Development  of  Criteria  for  Comparing  Alternative 
Estimation  Algorithms 

In  order  to  assess  the  performance  of  alternative  procedures 
for  estimation,  prediction  and  control,  it  is  necessary  to 
identify  a  number  of  quantitative,  measurable  descriptors  of 
performance.  Alternative  procedures  may  differ  in  a  number 
of  respects,  such  as  processing  speed  and  accuracy.  It  is 
desirable  to  identify  a  set  of  performance  measures,  or 
criteria,  which  afford  a  relatively  comprehensive 
description  of  algorithm  performance,  and  yet  is  not  overly 
redundant . 

It  is  noted  that  the  performance  measures  that  are 
appropriate  for  an  algorithm  may  vary,  depending  on  whether 
the  algorithm  is  intended  for  use  for  estimation,  or  for 
prediction,  or  for  control.  In  an  estimation  problem, 
attention  centers  on  estimating  the  model  parameters  (b 
I,  SIG,  SIGMA)  as  closely  as  possible.  In  a  predictiSn 
or  control  problem,  attention  centers  on  using  the  model  for 
predicting  a  new  value  of  y  corresponding  to  a  specified  set 
of  x-values.  How  close  the  parameter  estimates  are  to  the 
true  values  is  of  secondary  interest  in  this  situation.  (In 


a  "prediction"  problem,  the  x's  are  either  passively 
observed  or  actively  controlled;  in  a  "control"  problem,  the 
x's  are  actively  controlled.  The  problem  of  predicting  the 
state  of  the  economy  is  essentially  a  prediction  problem; 
the  problems  of  controlling  a  steel  production  process  or 
directing  a  "smart"  bomb  to  a  target  are  examples  of 
"control"  problems.)  The  intended  application  of  the  model 
--  estimation,  prediction  or  control  —  should  influence  the 
sample  design  for  the  collection  of  the  data  from  which  the 
model  is  to  be  estimated.  For  example,  if  a  model  is  to  be 
used  to  predict  how  the  dependent  variable  (y)  will  respond 
to  forced  changes  in  the  independent  variables  (x's),  then 
the  data  should  correspond  to  the  case  in  which  forced 
changes  are  made  in  the  x's.  A  model  developed  from 
passively-observed  x's  is  not  appropriate  for  predicting  how 
the  system  will  respond  if  forced  changes  are  made  in  the 
x's  (although  this  is  often  done,  with  di ssapoi nt i ng 
results  1 ) . 

After  consideration  of  the  various  uses  (estimation, 
prediction  and  control)  of  the  models  under  study,  and  of 
the  various  properties  of  algorithms  that  may  be  of  interest 
to  model  developers,  it  was  decided  that  the  following  seven 
concepts  characterized  the  algorithm  performance  in 
reasonably  comprehensive  fashion; 

1.  Ccanputer  running  speed 

2.  Computer  storage  requirements 

3.  Precison  of  the  parameter  estimates 

4.  Bias  of  the  parameter  estimates 

5.  Mean  squared  error  of  the  parameter  estimates 

6.  Precision  of  model-based  predictions 

7.  Numerical  stability  of  the  algorithm 

Having  decided  on  these  concepts  as  comprising  a  relatively 
comprehensive  characterization  of  the  performance  of  an 
algorithm,  it  remained  to  determine  quantitative  measures  of 
each  concept.  These  concepts  and  their  associated  measures 
are  described  in  the  paragraphs  that  follow.  Note  that, 
although  the  preceding  concepts  were  identified  as  part  of 
the  Phase  I  report,  it  was  not  possible  with  the  Phase  I 
resources  to  numerically  determine  values  for  all  of  the 
measures.  That  numerical  determination  can  be  accomplished 


with  some  additional  programming,  and  should  be  done  as  one 
of  the  first  steps  in  the  Phase  II  study. 

1.  Computer  Running  Speed 


The  primary  motivation  for  the  proposed  study  was  to 
determine  the  feasibility  of  determining  estimation, 
prediction  and  control  algorithms  that  had  faster  running 
times  than  the  classical  least-squares  method.  The  measure 
of  speed  that  was  used  in  this  study  was  the  total  elapsed 
time  required  to  read  the  data  from  the  file  on  which  it  was 
stored,  compute  the  parameter  estimates,  and  print  out  the 
results.  The  time  required  to  "set  up"  the  run  (e.g., 
specify  the  data  file  name,  number  of  parameters,  number  of 
observations,  etc.)  was  not  included,  since  this  time 
consists  mainly  of  the  time  of  the  human  operator  to  enter 
data  through  the  microcomputer  keyboard,  and  does  not 
reflect  algorithm  performance. 

All  of  the  computer  programming  and  processing  on  this 
project  was  done  using  a  Radio  Shack  Model  II  microcomputer 
using  a  TRSDOS  Version  2.0  operating  system.  This 
microcomputer  utilizes  a  4  MHz  Z80  microprocessor,  and  has 
64  kilobytes  of  direct  access  memory.  The  programming  was 
done  in  FORTRAN  II,  using  a  Microsoft  (R)  FORTRAN  compiler 
to  produce  the  object  code.  Unfortunately,  the  available 
Microsoft  FORTRAN  did  not  permit  access  to  the  system  timer, 
and  so  the  timing  was  done  manually  (external  to  the 
program),  not  automatically  (internal  to  the  program).  The 
running  times  presented  in  this  report  are  hence 
approximate.  In  addition  to  algorithm  processing  time,  they 
include  the  time  required  to  print  the  results.  The  amount 
of  printed  output  for  the  various  methods  is  comparable. 
For  the  simpler  cases  examined,  the  algorithm  processing 
time  was  very  short  compared  to  the  print  time,  and  so  the 
total  measured  running  time  is  not  an  accurate  reflection  of 
the  processing  time.  For  the  more  difficult  cases,  the 
algorithm  processing  time  is  large  relative  to  the  print 
time,  and  so  the  measured  running  time  is  a  relatively  valid 
indicator  of  algorithm  processing  time. 

It  is  recognized  that  the  speed  measure  used  in  this  study 
is  very  crude,  particularly  for  problems  having  a  small 
number  of  explanatory  variables  (since  the  processing  time 
for  these  problems  is  small  compared  to  the  data  access  and 
printout  time).  In  Phase  II,  we  propose  to  develop  an 
assembly-language  timer  that  can  be  called  internal  to  the 
program,  so  that  accurate  measures  of  processing  time  can  be 
determined. 
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The  time  measurements  for  the  iterative  algorithm  were 
biased  high  in  many  cases,  because  the  algorithm  was  forced 
to  make  at  least  ten  iterations  even  if  convergence  had 
already  occurred.  In  Phase  II,  a  test  for  convergence 
should  be  developed,  so  that  the  algorithm  stops  when 
convergence  occurs. 
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2.  Computer  Storage  Requirements 

The  amount  of  di  rect -access  computer  memory  required  to 
implement  an  algorithm  is  a  concern,  since  it  determines  how 
"large"  a  problem  can  be  handled  with  available  memory,  or 
how  much  memory  is  required  to  handle  a  problem  of  a  given 
size.  The  size  of  an  estimation  problem  is  determined 
primarily  by  two  factors  —  the  number  of  observations  and 
the  number  of  explanatory  variables  (we  are  speaking  here 
only  the  univariate  case,  in  which  there  is  but  a  single 
dependent  variable).  The  classical  least-squares  method  has 
an  advantage  in  that  the  observations  may  be  read  and 
processed  one-at-a-time ,  and  do  not  need  to  be  stored 
simultaneously  in  memory.  The  particular  "fast  algorithm" 
that  was  considered  in  this  study  required  that  all  of  the 
data  be  stored  in  memory. 

Although  the  required  amount  of  direct-access  memory 
required  depends  on  the  problem  size,  the  computer  programs 
developed  in  this  project  did  not  dynamically  adjust  the 
memory  requirement  to  the  size  of  the  problem.  Instead,  the 
"dimensions"  of  the  program  variables  were  set  to  allow  for 
storage  of  all  observations  and  variables,  corresponding  to 
the  largest  problem  analyzed  ---  100  observations  and  ten 
independent  variables  (and  a  single  dependent  variable). 
Under  these  conditions,  the  core  requirements  of  the 
classical  leas t -squares  algorithm  and  the  particular  "fast 
algorithm"  investigated  in  this  study  were  as  follows; 

Least-squares  algorithm:  20,984  bytes 

Fast  algorithm;  21,759  bytes, 

i.e.,  the  core  requirements  are  approximately  the  same. 

3.  Precision  of  the  Parameter  Estimates 

In  an  estimation  problem,  it  is  desired  to  determine 
("estimate")  the  values  of  the  model  parameters  as  "closely" 
or  "accurately"  as  possible.  The  model  parameters  are  b  , 
I  ,  SIGMA,  and  SIG.  With  regard  to  estimation,  howeve?, 
we  are  generally  interested  only  in  b  b,  and  SIG,  not 
in  E  or  SIGMA.  The  reason  for  this  limitatfon  is  that  the 
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parameters  ^  and  SIGMA  are  not  generally  of  concern  in  an 
estimation  problem  once  the  x's  are  known.  The  usual 
objective  in  a  data  analysis  is  to  estimate  the  values  of 

,  b,  and  SIG,  or  to  predict  the  value  of  a  new  y  given 
specrfied  values  of  the  x*s.  While  the  fact  that  the  x's 
are  random  variables  can  affect  some  analysis  procedures 
(e.g.,  hypothesis  testing),  the  least-squares  estimates  are 
the  same  whether  the  x's  are  fixed  numbers  or  random 
variables.  For  this  reason,  we  shall  restrict  attention 
only  to  measures  of  precison  of  the  estimates  of  b  b, 
and  SIGMA.  This  restriction  applies  also  to  the  next  two 
performance  concepts  considered  (bias  and  accuracy). 

Accuracy  is  usually  reflected  in  two  concepts  —  precision 
and  bias.  The  "precision"  (or  reliability)  of  an  estimator 
(i.e.,  an  estimation  formula  or  algorithm)  refers  to  the 
degree  of  reproducibility,  or  variation,  of  the  estimates, 
if  repeated  data  samples  were  selected  and  the  estimator 
used  to  determine  the  estimate  value  for  each  sample.  The 
usual  measure  of  precision  is  the  standard  deviation  (square 
root  of  the  variance,  or  second  central  moment).  The  "bias" 
of  a  parameter  estimator  is  the  amount  of  systematic  error 
in  the  estimate,  measured  as  the  difference  between  the 
average  value  obtained  for  the  parameter  estimate  in 
repeated  sampling  and  the  true  value  of  the  parameter. 
(Note:  in  the  preceding  discussion,  we  have  used  the 
distinction  that  an  "estimate"  is  a  numerical  value  that 
represents  our  guess  as  to  the  value  of  the  parameter, 
whereas  the  terra  "estiraator"  refers  to  the  formula  or 
algorithm  for  producing  that  numerical  value.  This 
distinction  in  usage  of  these  terms  is  not  strict,  either  in 
the  field  of  statistics  or  in  this  report.) 

In  the  present  study,  it  was  possible  to  estimate  the 
precision  of  the  parameter  estimates  for  the  classical 
leas t -squares  method,  but  project  resources  dis  not  permit 
the  estimation  of  precision  of  the  parameter  estimates  for 
the  fast  algorithm  studied.  Closed-form  mathematical 
formulas  are  available  for  estimating  the  precision  of  the 
leas t -squares  estimates,  but  such  is  not  the  case  for 
estimating  the  precision  of  the  fast  algorithm  estimates. 
Instead,  the  precision  has  to  be  measured  empirically,  by 
selecting  many  independent  data  samples  and  computing  the 
standard  deviation  of  the  parameter  estimates  over  these 
samples.  Although  it  was  not  possible  to  implement  this 
procedure  in  Phase  I  (because  of  resource  limitations),  the 
estimated  standard  deviation  of  the  parameter  estimates  is 
considered  to  be  an  important  measure  of  algorithm 
performance,  and  this  procedure  should  be  implemented  in  the 
Phase  II  effort. 


While  the  precision  and  bias  of  the  parameter  estimates  are 
of  high  concern  in  estimation  problems,  they  are  of 
secondary  conern  in  prediction  and  control  problems.  In  the 
latter  types  of  problems,  it  is  the  accuracy  of  the 
prediction  that  is  of  primary  concern.  It  is  possible  to 
have  a  model  in  which  the  parameter  estimates  are  not  very 
accurate,  and  yet  the  accuracy  of  the  predictions  based  on 
that  model  is  comparable  to  those  obtained  from  a  model  in 
which  the  parameter  estimates  are  substantially  more 
accurate.  (This  may  be  the  case,  for  example,  if  the 
explanatory  variables  are  highly  correlated.) 

The  precision  of  most  statistical  estimates  increases  as  the 
sample  size  (number  of  data  observations)  increases,  usually 
by  the  factor  1/ ^  (for  the  standard  deviation). 

4.  Bias  of  Parameter  Estimates 

The  bias  of  an  estimate  is  the  expected  value  (over  repeated 
data  samples)  of  the  difference  between  the  expected  value 
of  the  parameter  estimate  and  the  true  value  of  the 
parameter.  The  bias  of  an  estimator  is  of  greater  concern 
in  estimation  probalems  than  in  prediction  and  control 
problems . 

As  was  the  case  in  the  measurement  of  precision.  Phase  I 
project  resources  did  not  permit  the  generation  an  analysis 
of  a  large  number  of  data  samples  (corresponding  to  the  same 
model  parameters)  to  estimate  the  bias. 

The  bias  of  an  estimator  may  or  may  not  decrease  as  the 
sample  size  increases. 


5.  Mean  Squared  Error 


The  mean  squared  error  is  a  measure  of  accuracy  —  i.e.,  it 
is  a  "combined"  measure  of  precision  and  bias.  The 
definition  of  the  mean  squared  error  of  a  parameter 
estimator  is: 


Mean  Squared  Error  =  variance  +  bias2 


or 

E(p  -  p)2  =  E(p  -  E(p))2  +  (E(p)  -  p)2, 


where  p  is  the  parameter  value  and  p  is  the  estimate,  i.e., 
the  mean  squared  error  is  the  expected  value  of  the  square 


of  the  difference  between  the  estimated  parameter  value  and 
the  true  parameter  value,  over  repeated  data  samples. 

Once  more.  Phase  I  resources  did  not  permit  the  numerical 
determination  of  the  mean  squared  error,  but  this  can  be 
done  in  Phase  II. 

6.  Precision  of  Model-Based  Prediction 

The  three  preceding  measures  are  concerned  wtih  measurement 
of  the  "closeness"  of  the  parameter  estimates  to  the  true 
values,  or  to  the  amount  of  variability  in  the  parameter 
estimates  in  repeated  data  samples.  For  prediction 
problems  ,  the  primary  concern  is  how  close  the  model-based 
predictions  (of  y,  the  dependent  variable)  corresponding  to 
a  specified  set  of  x-values  will  be  to  a  newly-sampled  y 
corresponding  to  those  x-values.  An  interesting  fact  is 
that  there  can  be  fairly  substantial  errors  in  the 
estimation  of  the  model  parameters,  and  yet  predictions 
based  on  the  (erroneous)  model  may  be  almost  as  good  as 
those  based  on  a  much-more-nearly-correct  model.  In  some 
applications  (e.g.,  econometric  modelling),  attention 
centers  very  much  on  estimation  of  the  model  parameters.  In 
other  applications,  the  parameter  values  are  of  incidental 
interest  —  all  that  matters  is  the  error  of  prediction. 

A  standard  indicator  of  the  prediction  error  is  the 
estimated  variance  or  standard  deviation  of  the  model 
"residuals,"  or  error  terms  (differences  between  the 
observed  y-values  and  those  predicted  ty  the  model).  This 
is  not  the  same  as  the  standard  deviation  of  the  prediction 
error  for  a  particular  set  of  x's,  which  depends  also  on  the 
specified  values  of  the  x's.  The  standard  deviation  of  the 
prediction  error  is  proportional  to  the  standard  devaition 
of  the  residuals,  however,  and  so  the  latter  is  a  good 
indicator  of  the  predictive  ability  of  the  model. 

Another  indicator  of  the  predictive  power  of  a  model  is  the 
reduction  in  variance  between  predictions  based  on  the 
trivial  model  that  predicts  that  each  new  y  will  equal  the 
mean  (of  the  y's),  and  the  variance  of  the  predictions  based 
on  the  estimated  model.  (This  measure  is  valid  only  for 
application  of  the  model  to  predict  y-values  from  x-values 
that  were  produced  in  the  same  fashion  —  e.g.,  passively 
observed,  forcibly  changed  —  as  were  those  from  which  the 
model  parameters  were  derived.  Also,  its  use  assumes  that 
the  theoretical  variance  of  y  is  finite,  which  is  often  not 
the  case  for  time  series  data.)  The  standard  error  of  the 
residuals  is  best  estimated  from  a  sample  other  than  that 
from  which  the  model  parameters  were  derived,  but  as  long  as 
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the  number  of  parameters  is  very  small  relative  to  the 
number  of  observations,  it  is  common  practice  to  use  the 
same  data  set  for  both  purposes. 

The  reduction  in  variance  for  predictions  based  on  the  "mean 
model"  compared  to  predictions  based  on  the  estimated  model 
is  called  the  "coefficient  of  determination."  It  can  be 
defined  as: 

CD  =  1  -  (variance  of  predictions  using  "mean  model") 
/(variance  of  predictions  using  estimated  model) 

The  coefficient  of  determination  was  determined  both  for  the 
classical  least -squares  model  and  the  fast  algorithm 
algorithm  studied  in  this  project. 

6 .  Numerical  Stability  of  the  Algorithm 

The  three  preceding  measures  of  algorithm  performance  are 
appropriate  in  most  cases.  In  some  situations,  however,  an 
algorithm  may  fail  to  operate  as  intended  because  of 
computer  roundoff  errors  or  because  of  an  intrinsic  wea)cness 
in  the  algorithm,  such  as  a  failure  to  converge  to  an  answer 
close  to  the  desired  answer,  or  a  failure  to  converge  at 
all.  The  length  of  time  required  for  convergence  to  a 
desired  answer  is  considered  under  performance  measure  1 
(computer  processing  speed),  if  the  process  converges.  (An 
additional  measure  of  performance  that  is  of  interest  in 
studying  the  performance  of  an  iterative  algorithm  is  the 
"rate"  of  convergence,  or  the  number  of  iterations  required 
for  convergence.) 

Some  of  the  test  cases  examined  in  the  project  present 
difficulties,  for  both  the  classical  least-squares  and  the 
fast  algorithm.  In  some  such  cases,  the  algorithm  may  fail 
catastrophically,  i.e.,  produce  totally  wrong  results.  In 
other  such  cases,  the  algorithm  may  produce  an  answer  (i.e., 
set  of  parameter  estimates)  that  is  not  very  close  to  the 
correct  answer  (i.e.,  the  true  parameter  values).  Those 
cases  are  noted,  with  an  indication  of  the  nature  of  the 
failure.  For  example,  the  classical  least-squares  method 
may  fail  because  of  an  inability  to  invert  a  near-singular 
matrix.  Or,  a  "fast"  algorithm  may  converge  very  slowly. 

2.  Generation  of  Test  Cases 


The  performance  of  an  estimation  algorithm  may  vary, 
depending  on  the  nature  of  the  data  set  to  which  it  is 
applied.  We  proposed  to  develop  a  set  of  test  cases  to 
which  candidate  algorithms  could  be  applied,  and  to  measure 
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their  performance  relative  to  each  test  case.  Although 
there  is  an  infinite  variety  of  test  cases  that  might  be 
considered,  it  was  possible  in  the  present  project  to 
generate  and  analyze  only  a  few.  It  was  decided  to  examine 
sixteen  cases  in  all.  The  nature  of  these  test  cases  was 
described  in  Section  I  of  this  report.  The  test  cases 
differ  in  terms  of  the  number  of  variables  included  in  the 
model,  and  in  terms  of  the  complexity  of  the  model,  as 
reflected  in  the  covariance  matrix  of  the  explanatory 
variables  (x*s). 

The  general  model  considered  in  this  study  was  of  the 
following  form: 


®  ^o  2j ®  j  j=l,2,...,n 

or 

Z  =  ^q1  +  X'b  +  e 

where 

=  dependent  variable 

*  ^^1 »y2 » • • * '^n ^  ®  vector  of  all 
observed  y 's 

ij  *  =  (Xj^j  ,X2j  , . .  .  ,Xjj,j  )  =  vector  of  inde¬ 
pendent  variables  corresponding  to  the 
j-th  observation 

X  ”  1 ' * • * '— n ^  “  data  matrix 

®'  =  ( ,02 , . . . ,  )  =  vector  of  model  error 

terms  corresponding  to  all  observations 

=  y-intercept 

“  ^^l»t*2' •  •  • “  vector  of  regression 
coefficients 

var(Xj)  =  z  siGMA^ 

var(e)  =  I  SIg2 


®  (1,1,. ..,1)  =  n-component  vector  of  all  I's 

where  is  an  mxm  covariance  matrix  and  I  is  an  nxn  identity 
matrix.  The  various  test  cases  correspond  to  four  different 
values  of  m  (m  =  1,  3,  6,  and  10),  various  values  of  SIGMA 
and  SIG,  and  various  covariance  matrices  I  . 

The  x's  are  random  variables  that  are  generated  as  follows. 
First,  m  independent  random  variables,  ,  f 9 , . . . , f^ , 
normally  distributed  with  mean  0  and  variance  bIGMA,  are 
sampled.  An  mxm  coefficient  matrix,  C,  is  then  specified: 


The  X ' s  are  computed  as 
X  =  C  ^  . 

The  covariance  matrix  of  x  is 

varx  =  CC  SIGMa2  =  z  sigma2  , 
where  z  =  CC ' . 

To  generate  y^  corresponding  to  a  specified  x  (say, 

/  '  ,  4  standardized  normal  deviate,  e-i,  is  sampled, 

multiplied  by  SIG,  and  the  result  added  to  x'b: 


=  ^^o  +  2Sj  'b  +  SIG  ej  . 

From  the  preceding,  it  is  seen  that  the  variance  matrix 
is  not  explicitly  specified.  Instead,  it  is  the  coefficient 
matrix  C  that  is  actually  specified.  The  matrix  i  has  the 
value  CC. 

The  coefficient  matrices  and  values  of  b  k,  sig,  and 
SIGMA  for  the  various  test  cases  are  specifTed  in  the 
computer  printouts  of  Appendix  B.  The  values  of  the 
corresponding  covariance  matrices  Z  are  also  specified  in 
those  printouts,  as  part  of  a  larger  matrix,  The 

covariance  matrix,  Z  specified  in  the  printouts  includes 
an  additional  row  and  column,  which  contains  the  variance 
and  covariances  of  y  with  each  x,  i.e.. 


^  p,  =  var  (y  ,x' ) 

=  var  ( b^]^  +  X I  b  +  e ,  x ' ) 
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The  normally  distributed  random  numbers  were  generated  by 
the  algorithm 

12 

f  =  SIGMA  z  u. 

i=l  ^ 


where  are  a  sequence  of  independent 

uniformly  di  st  ri  bute^x  random  numbers.  The  number  u-  ^as 
generated  by  the  multiplicative  congruential  metnod, 
specified  for  the  16-bit  Radio  Shack  Model  II  microprocessor 
by  the  algorithm; 

^^tent  =  IXold  (2^®  +  3)  =  259 

jjj  ^^tent  if  intent  ^  ^ 

^  1/^tent  +  (216-1)^1  =  IX^^„,  +  32767+1 

if  f\ent  <  0 

'^i  =  i^new  (2"i^)  =  I^new  ( •  30517578E-4) 

IX  1  j  =  ix 
old  -^^new 

The  multiplicative  congrential  method  for  generating 
pseudo-random  numbers  is  described  in  Reference  10. 


The  rationale  for  the  specification  of  the  test  cases  is  as 
follows.  For  the  case  of  a  single  independent  variable  (m  = 
1)  ,  the  four  test  cases  involved  four  different  values  of 
the  ratio  SIG/SIGMA;  .1,  .5,  1.0,  and  5.  The  case  with 

SIG/SIGMA  =  .1  is  easiest,  since  there  is  a  substantial 
amount  of  variation  in  x  and  a  small  model  error  term. 


f 


For  the  cases  with  m  greater 
SIG/SIGMA  was  set  equal  to  .1, 
variance  matrix  was  varied 
set  equal  to  the  identity 
orthogonal  (uncorrelated), 
coefficient  matrix  used  to 
was  specified  to  correspond 
correlation  among  the  x's. 
coefficient  matrix  was 
moderate-to-high  degree  of 


than  one,  the  value  of 
and  the  complexity  of  the 
For  the  first  subcase,  was 
matrix,  i.e.,  the  x's  were 
For  the  second  subcase,  the 
generate  the  x's  (and  hence  I  ) 
to  a  low-to-moderate  degree  of 
For  the  third  subcase,  the 
specified  to  produce  a 
correlation  among  the  x's.  In 


the  final  subcase,  a  linear  dependency  was  introduced  among 
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the  x's:  the  last  x  was  set  equal  to  the  sum  of  the  two 
preceding  x's. 


The  values  of  the  regression  coefficient.  b,  were  chosen 
to  make  the  estimation  difficult.  The  correlations  among 
the  x's  were  all  positive,  and  so  the  regression 
coefficients  were  specified  to  be  plus  ones  and  minus  ones. 
This  caused  convergence  of  the  iterative  algorithm  to  be 
slow,  because  each  time  an  adjustment  was  made  in  one 
direction  for  a  particular  coefficient,  a  reverse  adjustment 
would  be  required  on  the  next  iteration  for  coefficients 
having  the  opposite  sign. 

3.  Synthesis  of  a  Candidate  Algorithm 

In  order  to  demonstrate  the  feasibility  of  the  proposed 
concept  of  developing  a  fast  estimation  algorithm  that  could 
be  used  as  an  alternative  to  the  classical  least-squares 
algorithm,  it  was  proposed  to  synthesize  one  or  more 
alternative  algorithms,  and  to  examine  their  performance. 
The  class  of  algorithms  that  was  synthesized  in  this  project 
are  extensions  of  the  Wald  and  Bartlett  estimators,  extended 
to  the  case  of  more  than  one  independent  variable. 

The  algorithm  is  iterative.  At  the  k-th  iteration,  the 
residuals,  e.^'l,  from  the  preceding  iteration  are 
regressed  on  a^  particular  independent  variable,  x .  ^  using 
either  the  Wald  or  Bartlett  method: 


The  process  cycles  through  all  of  the  x's  in  order.  That 
is,  the  first  iteration  regresses  y  on  x^^ .  The  second 
iteration  regresses  the  residuals  from  the  first  regression 

on  X2.  The  third  iteration  regresses  the  residuals  from 
the  second  regression  on  x,,  and  so  on.  If  there  are  m 
independent  variables,  then  the  (m+l)-st  regression  begins 
over  again,  and  regresses  the  residuals  from  the  m-th 
regression  on  x^ ,  and  so  on. 

The  preceding  iterative  procedure  is  conjectured  to  produce 
consistent  parameter  estimates  of  the  model  parameters 


(i.e.,  estimates  that  converge  in  probability  to  the  true 
parameter  values  as  the  sample  size  increases). 
Determination  of  whether  this  is  true,  or  of  conditions 
under  which  it  is  true,  should  be  addressed  in  Phase  II. 

In  the  present  study,  several  modifications  of  the  preceding 
procedure  were  examined.  For  example,  a  "stepwise" 
procedure  was  considered,  in  which  the  regression  at  each 
iteration  was  performed  on  the  independent  variable,  x^, 
which  resulted  in  the  greatest  reduction  in  the  criterion: 

br  IQr 

where  IQ^  denotes  the  interquartile  range  of  Xj.,  and 
bj.  denotes  the  Wald  regression  coefficient  of  the 
residuals  at  that  iteration  on  x  Another  approach 
considered  was  to  regress  the  residuals  separately  on  all 
x's,  and  then  to  adjust  all  m  regression  coefficents  by  a 
specified  fraction  ("stepsize")  of  the  indicated  adjustment. 
No  modified  algorithm  was  found,  however,  that  performed 
faster  than  the  one  specified  above. 

Both  the  Wald  and  Bartlett  procedures  were  applied  to  the 
test  cases  for  which  m  =  1,  but  only  the  Wald  procedure  was 
applied  to  the  other  test  cases.  The  iterative  procedure 
described  above  will  be  referred  to  as  an  "iterative 
Wald-Bartlett "  estimation  method. 

E .  Comparison  of  the  Performance  of  the  Candidate  Algorithm 
to  the  Performance  of  the  Classical  Least-Squares  Algorithm 

Each  of  the  sixteen  test  cases  was  analyzed  using  both  the 
classical  1 e a s t - s q u a r e s  (CLS)  and  the  iterative 
Wald-Bartlett  (IWB)  algorithms.  The  performance  of  each 
case  was  determined,  using  the  processing  speed  and  the 
coefficient  of  determination  as  the  measures  of  performance, 
or  observing  whether  the  method  failed  catastrophically. 
The  closeness  of  the  residual  standard  error  to  the 
parameter  SIG  was  also  noted.  The  results  of  these 
simulations  are  described  in  the  next  section  of  this 
report . 
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IV.  SIMULATION  RESULTS 
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A.  Test  Cases  Involving  a  Single  Independent  Variable 

The  four  test  cases  with  m  =  1  independent  variable  were 
analyzed  using  both  the  Wald  and  Bartlett  estimates,  and  the 
classical  leas t -squares  estimator.  In  general,  all  cases 
ran  so  fast  that  speed  differences  could  not  be  reliably 
determined  among  the  methods.  All  methods  produced 
parameter  estimates  close  to  the  true  values,  and  all 
produced  similar  values  for  the  residual  standard  error  and 
the  coefficient  of  determination.  A  summary  of  the  results 
of  the  four  cases  with  m  =  1,  and  all  of  the  other  test 
cases  as  well,  is  presented  in  Figure  2. 

B.  Test  Cases  Involving  Three  Independent  Variables 
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For  the  test  cases  with  m  =  3  independent  variables,  the 
data  were  analyzed  with  both  the  iterative  Wald-Bartlett 
(IWB)  and  classical  least -squares  (CLS)  methods.  The 
results  are  presented  in  Figure  2.  Both  the  CLS  and  lYB 
methods  appear  to  be  of  comparable  speed,  in  cases  in  which 
the  CLS  method  does  not  fail.  The  CLS  method  fails  if  the 
correlations  among  the  x's  are  high,  or  if  there  is  a  linear 
dependency  among  the  x's.  The  IWB  method  succeeds  in  these 
cases,  but  does  not  converge  very  fast.  The  slow 
convergence  is  probably  due  to  the  fact  that  the  regression 
coefficients  were  purposely  specified  to  cause  slow 
convergence . 

C.  Test  Cases  Involving  Six  Explanatory  Variables 

The  results  for  the  cases  involving  six  explanatory 
variables  are  as  follows.  For  cases  in  which  the  CLS  method 
does  not  fail,  it  is  somewhat  faster  than  the  lYB  method, 
for  the  difficult  estimation  problems  represented  by  the 
test  cases  .  For  cases  in  which  the  correlations  among  the 
x's  is  high,  the  CLS  method  fails.  In  these  cases,  the  IWB 
method  is  slow  to  converge.  Once  again,  the  slow 
convergence  is  probably  due  to  the  "pathological" 
specification  of  the  regression  coefficients. 

D.  Test  Cases  Involving  Ten  Independent  Variables 
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Figure  2.  Algorithm  Performance  on  Sixteen  Test  Cases 


Model  Parameters 

Case  SIG 

No.  m  Corr  SIGMA  SIG  CD 


Performance  Measures 


3  Low 


3  High 


3  LDEP 


6  0 


6  Low 


6  High 


6  LDEP 


Time 

(sec)  RSE 


9901 

CLS 

40 

1.12 

WALD 

35 

1.12 

BART 

35 

1.12 

8000 

CLS 

40 

1.12 

WALD 

35 

1.12 

BART 

35 

1.12 

5000 

CLS 

40 

1.12 

WALD 

35 

1.12 

BART 

35 

1.12 

0385 

CLS 

40 

1.12 

WALD 

35 

1.12 

BART 

35 

1.12 

9967 

CLS 

65 

1.15 

IWB 

70 

1.15 

9967 

CLS 

65 

1.15 

IWB 

55 

1.16 

9955 

CLS 

65 

6.20 

IWB 

200 

1.20 

9975 

CLS 

65 

1.52 

IWB 

70 

1.15 

9983 

CLS 

105 

.99 

IWB 

130 

1.01 

9985 

CLS 

105 

.99 

IWB 

155 

1.15 

9989 

CLS 

105 

107. 

IWB 

840 

1.13 

9986 

CLS 

105 

8.68 

IWB 

70 

1.00 

CD  Fa 
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Model  Parameters 


Performance  Measures 


Case 

NO. 

m 

Corr 

SIG 

SIGMA 

SIG 

CD 

Est 

Meth 

Time 

(sec) 

RSE 

CD 

Fail 

13 

10 

0 

.1 

1.0 

.9990 

CLS 

180 

1.10 

.9989 

IWB 

200 

1.28 

.9988 

14 

10 

Low 

.1 

1.0 

.9988 

CLS 

180 

1.10 

.9988 

IWB 

300 

1.17 

.9984 

15 

10 

High 

.1 

1.0 

.9997 

CLS 

180 

610. 

— 

Fail 

IWB 

1800 

2.03 

.9991 

16 

10 

LDEP 

.1 

1.0 

.9991 

CLS 

— 

_ 

Fail 

IWB 

185 

1.13 

.9988 

Legend ; 

m;  number  of  explanatory  variables  (x’s) 

Corr:  degree  of  correlation  in  x's 

CD;  coefficient  of  determination 

Est  Meth;  CLS;  classical  least-squares  method 
WALD;  Wald's  method 
BART;  Bartlett’s  method 
IWB;  iterative  Wald-Bartlett  method 

Time;  time  to  read  data,  process  data,  and  print  results 

RSE;  residual  standard  error 

Fail;  the  CLS  method  failed  to  invert  the  correlation  matrix, 
or  the  correlation  matrix  was  singular,  and  the  solution 
was  incorrect 
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The  results  for  the  test  cases  involving  ten  independent 
variables  were  as  follows.  For  cases  in  which  the  CLS 
method  succeeded,  it  is  faster  than  the  IWB  method  for  the 
test  cases  studied.  For  the  cases  in  which  the  correlation 
among  the  x's  is  high,  the  CLS  method  fails,  and  the  lYB 
method  is  slow.  In  the  worst  case  (case  15),  the  process 
was  terminated  after  1800  seconds.  At  that  point,  the  model 
had  reached  a  solution  close  to  the  correct  solution,  but 
not  correct  --  the  residual  standard  error  was  2.03, 
compared  to  the  true  value  of  1.0. 

In  summary,  the  lYB  method  appears  to  be  comparable  in  speed 
to  the  CLS  method  for  problems  with  a  small  number  of 
explanatory  variables  or  low  correlations  among  the  x's. 
For  difficult  problems  (with  high  correlations  among  the 
x's)  ,  the  CLS  method  fails.  The  lYB  method  converged  in  all 
of  the  difficult  cases  but  one. 


V.  CONCLUSIONS  AND  RECOMMENDATIONS 


A.  Conclusions 


This  study  has  demonstrated  the  feasibility  of  developing  an 
algorithm  for  estimating  the  parameters  of  a  linear 
statistical  model  and  making  predictions  based  on  the 
estimated  model,  that  is  comparable  in  speed  to  the 
classical  least-squares  method  for  problems  of  low  to 
moderate  difficulty,  and  is  definitely  more  robust,  in  the 
sense  that  it  is  less  subject  to  catastrophic  failure.  The 
feasibility  was  established  by  synthesizing  an  algorithm  — 
an  extension  of  the  estimation  procedures  of  Wald  and 
Bartlett  —  which  often  outperformed  the  classical  method. 

The  availability  of  a  fast,  robust  estimation  procedure 
would  be  beneficial  to  both  military  and  nonmilitary 
applications.  In  a  military  context,  there  is  a  growing 
need  for  faster  estimation  procedures  —  current  procedures 
cannot  accomplish  tracking  of  large  numbers  of  objects  in 
real-time,  or  accomplish  large-scale  sensor  exploitation  in 
real-time.  Also,  embedded-processor  estimation  algorithms 
are  non-i nteracti ve  (i.e.,  must  perform  without  human 
intervention),  and  are  potentially  subject  to  catastrophic 
failure  if  based  on  the  classical  least-squares  procedure, 
if  highly  correlated  data  are  entered  into  the  data  input 
stream. 

In  both  military  and  commercial/industrial  applications, 
there  is  a  requirement  for  "fail-safe"  estimation 
algorithms.  The  classical  least-squares  algorithm  that  is 
currently  in  use  is  not  fail-safe.  For  problems  involving 
moderate  or  large  numbers  of  variables,  computer  roundoff 
errors  can  ruin  the  estimates,  and  the  user  may  be  totally 
unaware  of  the  failure.  The  danger  of  this  occurrence  is 
particularly  strong  in  microcomputers  having  short  word 
lengths  (e.g.,  16  bit  microprocessors),  and  is  a  problem 
even  for  32-bit  machines.  Furthermore,  many  persons  using 
statistical  software  packages  are  not  aware  that  linear 
dependencies  in  the  variables  can  cause  the  complete  failure 
of  the  methods.  Once  again,  roundoff  errors  may  obscure  the 
problem,  so  that  the  user  has  no  reason  to  believe  that  the 
estimation  algorithm  failed. 


While  the  present  project  has  demonstrated  the  feasibility 
of  developing  fast,  robust  algorithms,  it  has  not 
accomplished  the  ultimate  goal  of  developing  such  an 
algorithm.  The  algorithm  that  was  sythesized  in  this 
project  is  not  considered  to  be  a  final  solution  to  this 
problem  —  it  does  not  outperform  the  classical 
least-squares  method  in  every  case,  and  its  theoretical 
properties  (e.g.,  convergence,  consistency  of  estimates)  are 
unknown.  Continuation  of  the  effort  to  develop  improved 
(fast,  robust)  estimators  will  require  substantial 
additional  effort.  The  present  study  suggests,  however, 
that  the  potential  for  success  of  such  an  effort  is  high. 

B.  Recommendations 

It  is  recommended  that  a  Phase  II  study  be  conducted, 
oriented  toward  the  goal  of  developing  improved  algorithms 
for  estimation,  prediction  and  control.  It  is  further 
recommended  that  the  Phase  II  effort  should  address  the 
following  tasks. 

1.  Extension  of  the  results  of  the  Phase  I  study,  to 
include  analysis  of  a  wider  range  of  test  cases, 
and  measurement  of  the  full  set  of  performance 
measures  identified  in  this  study. 

2.  Development  of  a  broader  class  of  algorithms, 
additional  to  the  iterative  Wald-Bartlett  method 
developed  in  this  Phase  I  effort, 

3.  Extension  of  the  algorithm  to  consider  estimation 
problems  additional  to  the  multiple  linear 
regression  problem  considered  in  the  Phase  I  effort. 
Consideration  should  be  given  to  developing  fast, 
robust  methods  for  the  full  range  of  problems 
currently  addressed  by  least-squares  methods, 

such  as  multivariate  analysis  of  variance  and 
time  series  analysis  procedures  (currently  done 
by  Box-Jenkins,  Kalman  filter,  and  state-space 
methods ) . 

4.  Analyze  the  theoretical  numerical  and  statistical 
properties  of  candidate  algorithms,  such  as 
convergence  conditions  and  consistency. 

If  successful,  it  is  expected  that  there  would  be  a 
substantial  military  and  non-military  demand  for  a 
statistical  estimation  computer  program  package  based  on  the 
improved  methods.  With  respect  to  military  applications, 
such  methods  offer  the  potential  for  fast,  fail-safe 


processing  of  tracking  and  sensor  exploitation  data.  With 
respect  to  non-military  applications,  the  methods  are  much 
more  "user-friendly"  than  the  least-squares  method,  in  that 
the  user  would  be  protected  from  catastrophic  failures,  and 
would  not  need  to  understand  matrix  algebra  concepts  such  as 
linear  dependencies,  singularities,  and  numerical  stability 
problems  in  matrix  inversion,  to  be  assured  of  successful 
application  of  the  procedures.  In  view  of  the  large  number 
of  persons  involved  in  data  analysis,  and  the  growing  use  of 
microcomputers,  the  development  of  such  methods  is 
considered  to  be  a  very  significant  contribution  to  the 
field  of  data  analysis. 

The  determination  that  the  iterative  Wald-Bartlett  method 
avoided  the  catastrophic  failure  problem  of  the  classical 
least-squares  method  was  serendipi tously  discovered  during 
the  course  of  the  Phase  I  investigation.  Since  this 
property  of  the  algorithm  is  judged  to  be  probably  more 
important  than  speed  in  many  applications,  it  is  recommended 
that  the  title  of  the  Phase  II  study  be  changed  from  "Fast 
Algorithms  for  Estimation,  Prediction  and  Control,"  to 
"Improved  Algorithms  for  Estimation,  Prediction  and 
Control . " 

Reduction  of  the  danger  of  obtaining  wrong  answers  from 
regression  analyses  represents  an  area  of  potentially  great 
benefit,  to  a  wide  class  of  data  analysts.  The  original 
concept  of  this  study  was  to  develop  fast  algorithms, 
primarily  for  real-time  applications  such  as  tracking, 
sensor  exploitation,  or  industrial  process  control.  Those 
applications,  while  important,  concern  relatively  few  data 
analysts.  The  discovery  of  the  robustness  of  the  iterative 
Wald-Bartlett  algorithm,  however,  could  have  substantial 
impact  for  a  wide  class  of  data  analysts.  For  example,  a 
typical  logistics  application  involves  the  determination  of 
parametric  cost  estimating  relationships.  These 
relationships  are  estimated  by  linear  regression  analysis. 
Since  the  models  developed  are  empirical  in  nature,  they 
involve  the  analysis  of  a  large  number  of  cost-related 
variables.  The  presence  of  a  linear  dependency  in  the  data, 
or  the  occurrence  of  a  roundof f-error-caused  matrix 
inversion  failure  caused  by  high  correlation  among  some  of 
the  explanatory  variables,  could  produce  incorrect  results. 
The  availability  of  "fail-safe"  algorithms  would  benefit 
this  and  many  other  similar  applications.  In  view  of  the 
substantial  amount  of  funds  expended  by  the  Office  of  Naval 
Research  on  parametric  cost  analysis  and  other  data 
analysis,  the  benefits  of  improved  estimation  methods  would 
be  substantial. 
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Appendix  A 

Computer  Program  Source  Code  Listings 


g}RTRAN 

Created 

a^ioo 

^200 

(K)300 

K400 
500 
600 
00700 
^800 
P900 
OlOOO 
^100 
!}5-200 
ifl300 
01400 
s^500 

.meoo 

01700 

^”>1800 

:i9oo 

(J2000 

.(12100 

^^200 

W2300 

02400 

^2500 

W600 

02700 

12800 
p900 
3000 
93100 
^^3200 
S'3300 
03400 
p500 

V  n  ^  rv 


■80  Ver.  3.4  Copyright  1978,  79,  80  (C)  By  Microsoft  —  Bytes;  22872 
i  2 6 -Nov-80 


:-.3600 

03700 

M800 

v-^qnn 


^3900 

O’4000 

fL4100 

^4200 

;P-4300 

04400 


X4500 

^600 


04700 
<:^800 
2J:4900 
bsooo 
^100 
IS5200 
W300 
05400 
*.  v5500 
I;  :>l5600 
;•  05700 
I  ^800 


PROGRAM  SIMULA 

THIS  PROGRAM  GENERATES  THE  VECTOR  RANDOM  VARIABLE  X=C*F 
AND  THE  SCALAR  RANDOM  VARIABLE  Y=B0+B*X+SIG*E,  WHERE 
F  IS  AN  MXl  RANDOM  VECTOR,  X  IS  AN  MXl  VECTOR, 

C  IS  AN  MXM  MATRIX,  E(F)=0,  VAR(F )=I*SIGMA**2 , 

I  IS  AN  MXM  IDENTITY  MATRIX 

BO  IS  A  SCALAR  PARAMETER,  B  IS  AN  MXl  PARAMETER  VECTOR 
E  IS  A  SCALAR  N(0,1)  ERROR  TERM 

N  OBSERVATIONS  ARE  GENERATED,  AND  WRITTEN  TO  FILE  FNAMl 
INPUT  M,C(M,M) ,  SIGMA,  N,  B0,B(N),SIG 

OUTPUT  THE  N  OBSERVATIONS  (1  Y-COMPONENT  AND  M  X-COMPONENTS ) 
DIMENSION  C(ll,ll),  CT(11,11) ,VARX(11,11) ,  X(10),F(10) 

DIMENSION  SX(11),B(10) 

DOUBLE  PRECISION  FNAMl (2) 

DATA  LUN1,LUN2,LUN6/1,2,6/ 

LUNl; TERMINAL,  LUN2 : PRINTER,  LUN6:DATA  FILE  FOR  OUTPUT 
WRITE(LUN2,6) 

6  FORMAT { 15 HO  PROGRAM  SIMULA,/ 

142H  GENERATES  MULTIVARIABLE  UNIVARIATE  SAMPLE) 

WRITE (LUNl, 9) 

9  FORMAT (53H  ENTER  FILE  NAME,  1-16  ALPHA  CHARS,  LAST  CHAR  BLANK;  ) 
READ (LUNl, 8) FNAMl 
8  FORMAT (2A8) 

WRITE ( LUN  2,7) FNAMl 

7  FORMAT (12 HO FILE  NAME;  , 2A8 ) 

CALL  OPEN (LUN 6, FNAMl, 80) 

MXDIM=10 

5  CONTINUE 

WRITE (LUNl, 10) 

10  FORMAT (20 HO  IN PUT  DIMENSION,  M; ) 

READ (LUNl, 20 )M 
20  FORMAT(I2) 

IF(M.GT.MXDIM)GO  TO  5 
WRITE (LUN 2, 31 )M 

31  FORMAT ( 17 HODIMENSION  (M)  =  ,12) 

WRITE (LUN 2, 56) 

56  FORMAT (26H0 COEFFICIENT  MATRIX  (C)...) 

WRITE (LUNl, 42) 

42  FORMAT (51H0 INPUT  1  TO  USE  IDENTITY  COEFT  MATRIX,  0  OTHERWISE;) 
READ(LUN1,43)I0PT 

43  FORMAT (II) 

IF(IOPT.LE.O)GO  TO  44 
DO  46  1=1, M 

DO  47  J=1,M 

47  C(I,J)=0.0 
C(I,I)=1.0 

46  CONTINUE 

WRITE(LUN2,48) 

48  FORMAT ( 37 H  COEFFT  MATRIX  IS  IDENTITY  MATRIX) 

GO  TO  45 

44  CONTINUE 

DO  100  1=1, M 
WRITE(LUN1,40) ,I,M 

40  FORMAT (lOH  INPUT  ROW,I3,21H  OF  COEFFT  MATRIX,  C./ 

15H  M  =  ,I2,21H  VALUES,  8  AT  A  TIME;/) 


READ(LUN1,50) (C(I,J) ,J=1,M) 
WRITE (LUN 2, 57) (C(I,J) ,J=1,M) 
57  FORMAT(1X,8F10.4) 


'A' 
'•j  ■ 


.  >? 

|«  --O 
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i: 


I 


If 

'I 


1^ 
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V900 
06000 
^100 
&200 
^300 

i6400 

Isoo 

v>600 

06700 
tl-7^800 
(^5900 
07000 
80100 
^200 
(J7300 
07400 
C^^SOO 
tr-^eoo 

07700 
Hf7800 
C-*>900 
08000 
0-3100 
200 
300 
^  08400 
C  i":S500 
^600 
'•  08700 

!\  ^800 
B900 
.  09000 
Ji^'QlOO 
,  i.t9200 
f;  5^300 
b  0^400 
I  ^500 
^  u>9600 
09700 
0800 
;'>900 
10000 
^100 
:'')200 
10300 
J.0400 
‘<3500 
i€600 
10700 

V3800 

/:3900 
11000 
tAlOO 

jg.200 

n300 
t  ^1400 
^1500 
f.  5‘i600 
V  11700 

I  ^800 


50 

55 

100 

45 

60 

70 

81 

710 


711 


712 


727 


713 


714 


715 


90 


FORMAT(8F10.4) 

FORMAT(8F10.4) 

CONTINUE 
CONTINUE 
WRITE (LUNl, 60) 

FORMAT (14H  INPUT  SIGMA;  ) 

READ ( LUNl , 7 0 ) SI GMA 
FORMAT (FI 0.4) 

WRITE ( LUN  2,81) SIGMA 

FORMAT (49 HO  SIGMA  (SCALING  PARAMETER  FOR  INDEP  VARIABLES)  =  ,F10.4) 
WRITE (LUNl, 710) 

FORMAT (23H  INPUT  BO  (INTERCEPT):  ) 

READ(LUN1,70)B0 

WRITE(LUN2,711)B0 

FORMAT (18H0B0  (INTERCEPT)  =  ,F10.4) 

WRITE(LUN1,712)M 

FORMAT(27H  INPUT  PARAMETER  VECTOR,  B./ 

15H  M  =  ,I2,21H  VALUES,  8  AT  A  TIME:/) 

READ(LUN1,50) (B(I) ,I=1,M) 

WRITE(LUN2,727) 

FORMAT (24 HO  PARAMETER  VECTOR  (B)...) 

WRITE(LUN2,713) (B(I ) ,I=1,M) 

FORMAT(1X,8F10.4) 

WRITE(LUN1,714) 

FORMAT (12H  INPUT  SIG;  ) 

READ(LUN1,50)SIG 

WRITE(LUN2,715)SIG 

FORMAT (29H0 SIG  (MODEL  ERROR  STD  DEV)  =  ,F10.4) 

WRITE (LUN 1,90) 

NO  OF  OBSNS,  N,  TO  SIMULATE;  ) 


FORMAT ( 36 H  INPUT 
READ(LUN1,35)N 

35  FORMAT(I3) 

WRITE (LUN 2, 36) N 

36  FORMAT (14H0 NO  OF  OBSNS  =,I4) 

WRITE (LUNl, 91) 

91  FORMAT(33H  INPUT  NO  OF  RANDOM  NOS  TO  SKIP:  ) 
READ(LUN1,35)NSKIP 
WRITE{LUN2,92)NSKIP 

92  FORMAT (6H  SKIP  ,I3,31H  RANDOM  NOS  PRIOR  TO  SIMULATION) 
DO  95  I=1,NSKIP 

CALL  RANDN(TEMP) 

95  CONTINUE 

WRITE(LUN1,552) 

552  FORMAT(36H  INPUT  NO  OF  OBSERVATIONS  TO  PRINT;  ) 
READ(LUN1 ,35)NPRT 
COMPUTE  VARIANCE  MATRIX  OF  X 
CALL  ATRAN(C,M,M,CT) 

CALL  AB(C,CT,M,M,M,VARX) 

SIGS=SIGMA**2 

CALL  ASCAL(VARX,SIGS,M,M,VARX) 

COMPUTE  ROW  AND  COLUMN  CORRESPONDING  TO  Y 
CALL  AX(VARX,B,M,M,X) 

MP=M+1 

DO  325  1=1, M 
VARX(MP,I)=X(I ) 

VARX(I,MP)=X(I) 

325  CONTINUE 

CALL  CROSP(B,X,M,CP) 

VARX(MP,MP)=CP+SIG**2 

WRITE(LUN2,58) 


i-i 

p 


1. 


.900 
12000 
!100 
!200 
12300 

(2400 
2500 
2600 
;^700 
p800 
®2900 
13000 
■3100 
{93200 
13300 
>3400 
•^500 
*13600 
13700 
J3800 
X3900 
14000 
WlOO 

J^200 

"4300 

14400 

tt^soo 

1^600 

14700 

^800 

■1900 

15000 

^j5100 

3-^200 

15300 

^400 

^500 

i*5600 

15700 

>5800 

'■J6900 

16000 

^100 

>^6200 

'X6300 


58  FORMAT (27 HO VARIANCE  MATRIX  OF  X,  Y. . . ) 

DO  330  1=1, MP 
SX(I)=SQRT(VARX(I,I) ) 

WRITE(LUN2,57) (VARX(I,J) ,J=1,MP) 

330  CONTINUE 

VY=VARX(MP,MP) 

VRES=SIG**2 
CD=1.-VRES/VY 
WRITE (LUN 2, 640) CD 

640  FORMAT ( 32 HO COEFFICIENT  OF  DETERMINATION  = 
COMPUTE  CORRELATION  MATRIX 
DO  610  1=1, MP 
DO  610  J-=1,MP 

610  VARX(I,J)=VARX(I,J)/(SX(I)*SX(J) ) 

WRITE (LUN2, 54) 

54  FORMAT ( 30 HO CORRELATION  MATRIX  OF  X,  Y. . . ) 
DO  630  1=1, MP 

WRITE (LUN 2, 57) (VARX(I,J) ,J=1,MP) 

630  CONTINUE 

WRITE(LUN2,550) 

550  FORMAT ( 16H0 OBS ERVATIONS..  ./ 

124H  DEPENDENT  VARIABLE  LAST) 

WRITE(LUN2,553)NPRT 

553  FORMAT(8H  (FIRST  ,I3,15H  OBSNS  PRINTED)) 
KPRT=0 

DO  500  K=1,N 
DO  400  1=1, M 
CALL  RANDN(TEMP) 

F(I )=SIGMA*TEMP 
400  CONTINUE 

CALL  AX(C,F,M,M,X) 

COMPUTE  Y 
Y=B0 

DO  720  1=1, M 
720  Y=Y+B(I )*X(I ) 

CALL  RANDN(TEMP) 

Y=Y+SIG*TEMP 

WRITE(LUN6,55) (X(J) ,J=1,M) ,Y 
KPRT=KPRT+1 

IF(KPRT.GT.NPRT)GO  TO  501 
WRITE(LUN2,551)K, (X(J) ,J=1,M) ,Y 
501  CONTINUE 
500  CONTINUE 

551  FORMAT(9H  OBSN  NO  ,I4,2H;  , 2 (/IX, 8F10 , 4 ) ) 
END 


,F10.4) 


•fcogram  Unit  Length=08ED  (2285)  Bytes 
%ta  Area  Length=0B6C  (2924)  Bytes 


iCMbroutines  Referenced: 

•  ^ 


$13 

$11 

$10 

^RT 

$INIT 

$W2 

^D 

$R2 

OPEN 

Vm9 

$L1 

$T1 

PANDN 

ATRAN 

AB 

‘^.'SCAL 

$EA 

AX 

OROSP 

$AB 

$DB 

$SB 

$NB 

$MB 

Variables : 


1 

0001" 

CT 

01E5" 

VARX 

03C9  ” 

X 

05AD" 

P 

OSD  5" 

SX 

05FD" 

& 

0629" 

PNAMl 

0651" 

LUNl 

0661" 

■jN2 

0663" 

LUN6 

0665" 

MXDIM 

0  6FE" 

0719" 

T:000002 

071F" 

lOPT 

0790" 

I 

0796" 

J 

0798" 

T:000000 

I'hOlOOOO 

079C" 

SIGMA 

0846" 

BO 

08A9” 

BiG 

0948" 

N 

099D" 

NSKIP 

09DF" 

TEMP 

OAll" 

NPRT 

0A3E" 

SIGS 

0A4C" 

0A5C" 

T:020000 

0A5E" 

T:030000 

0A62" 

VY 

0A8A" 

VRES 

0A8E" 

CD 

0A92" 

TrOOOOOl 

OACl " 

KPRT 

0B3B" 

K 

1 

0B3D" 

Y 

0B45" 

Labels: 

V^L 

0006  ' 

6L 

0667" 

9L 

06AA" 

06E4" 

7L 

06E9" 

5L 

005E' 

lOL 

0700" 

20L 

071B" 

31L 

0720" 

,r*^L 

07  39" 

42L 

0758" 

43L 

0792" 

0188  ' 

46L 

0169  • 

47L 

0105' 

079E" 

45L 

0250' 

lOOL 

0240  ' 

40L 

07 C8  " 

50L 

0823" 

57  L 

0818" 

082B" 

60L 

0833" 

70L 

084A" 

irlL 

0851" 

710L 

088D" 

711L 

08  AD" 

712L 

08CA" 

727L 

090F" 

713L 

092C" 

^4L 

0937" 

715L 

094C" 

90L 

0974" 

Rl 

099F" 

36L 

09  A3" 

91L 

09B9  " 

92L 

09  El " 

95L 

0415* 

552L 

0A15" 

>J25L 

04F0  ' 

58L 

0A6A" 

330L 

05D1' 

S:40L 

0A96" 

610L 

064F' 

54L 

0AC5" 

630L 

071F' 

550L 

0AE8  " 

553L 

0B19" 

IpOL 

08C6  ' 

400L 

0788  ' 

720L 

07B6' 

^IL 

08C6  ' 

551L 

0B49" 

079A" 


0A60" 


16400 
:-:65oo 
■^•'66  00 
16700 
^800 
‘v6900 
^17000 
J,7100 
V^200 
V7300 
17400 

Svsoo 

>?7600 

17700 


SUBROUTINE  ATRAN (A,M,N,B) 

THIS  SUBROUTINE  COMPUTES  B=A-TRANSPOSE 
INPUT  MATRIX  A(M,N),M,N 
OUTPUT  MATRIX  B 

A  AND  B  MUST  BE  STORED  IN  DIFFERENT  LOCATIONS 
DIMENSION  A(M,N) ,B(N,M) 

DIMENSION  A{11,11) ,B(11,11) 

DO  100  1=1, M 
DO  110  J=1,N 
B(J,I)=A(I,J) 

CONTINUE 

CONTINUE 

RETURN 

END 


Program  Unit  Length=0090  (144)  Bytes 
^ata  Area  Length=0013  (19)  Bytes 


Subroutines  Referenced; 


ariables: 


0001" 

0007" 


1000000 

bels: 


M  0003" 

I  0009" 

GOOD"  TtOlOOOO 


N  0005" 

J  OOOB" 

OOOF"  T:020000 


0011" 


|00L 

17800 


1|7900 

^000 

IBIOO 

18200 


.^^300 

^^9400 


18500 

>3600 

>9700 

18800 

J.8900 

^9000 

iftlOO 

19200 

>:>300 

^9400 


007B'  llOL  0067' 

SUBROUTINE  AB( A, B,L ,M, N, C ) 

C  THIS  SUBROUTINE  COMPUTES  MATRIX  PRODUCT  C(L,N)=A(L,M)*B(M,N) 
C  INPUT  MATRIX  A(L,M) , MATRIX  B(M,N),L,M,N 
C  OUTPUT  MATRIX  C(L,N) 

C  DIMENSION  A(L,M) ,B(M,N) ,C{L,N) 

DIMENSION  A(ll,ll) ,B(11,11) ,C(11,11) 

DO  100  1=1, L 
DO  110  J=1,N 
TEMP=0.0 
DO  120  K=1,M 
TEMP=TEMP+A ( I , K ) * B ( K , J ) 

120  CONTINUE 

C(I,J)=TEMP 
110  CONTINUE 
100  CONTINUE 
RETURN 
END 


^'ogram  Unit  Length=00F5  (245)  Bytes 
Area  Length=001D  (29)  Bytes 

?jabroutines  Referenced: 

'I*. 

Vat  $li  $ti 

^M9  $MB  $AB 

S'ariables: 


^  0001" 

0007" 

I  OOOD" 

8,  0015" 

:-.1: 020000  OOlB 


B  0003" 

N  0009" 

J  OOOF" 

T:000000  0017" 


L  0005" 

C  OOOB" 

TEMP  0011" 

TiOlOOOO  0019 


Labels : 

p  \ 

i)0L  OODC  llOL  00C8'  120L  0085' 


=■.^500 
J:9600  C 
1?700  C 


?.?800 

£^900 

ibooo 

20100 

^50200 

b300 


20400 


C 

C 

C 

C 


SUBROUTINE  ASCAL ( A , SCAL ,M, N, B) 

THIS  SUBROUTINE  MULTIPLIES  MATRIX  A  TIMES  SCALAR  SCAL 
B=A*SCAL 

MATRICES  A  AND  B  MAY  BE  STORED  IN  THE  SAME  LOCATION 
INPUT  MATRIX  A(M,N),  SCALARS  SCAL,  M,N 
OUTPUT  MATRIX  B(N,M) 

DIMENSION  A(M,N) ,B(M,N) 

DIMENSION  A(ll,ll) ,B(11,11) 

DO  100  1=1, M 
DO  110  J=1,N 
B(I,J)=SCAL*A(I,J) 


«i 


■)600  110  CONTINUE 

20700  100  CONTINUE 

^800  RETURN 

^)900  END 


I 


ogram  Unit  Length=0081  (129)  Bytes 
ta  Area  Length=0015  (21)  Bytes 


Subroutines  Referenced: 

It 

$MB 


I 


riables : 


0001" 

0007" 

OOOD" 


$M9 

$T1 


SCAL  0003" 
B  0009" 
T:000000 


OOOF" 


$L1 


M  0005" 
I  OOOB" 
T:010000 


0011" 


T;020000 
'^vibels: 
JOOL 

Aooo 
21100 
*i;i200 
l:*i300 
21400 

*500 
600 
21700 
?.1800 
•:a9oo 
^2000 

KlOO 
200 
■?'2  300 
22400 
y2500 

feeoo 


0013" 


006C' 


llOL 


0058 


SUBROUTINE  AX (A,X,M,N,Y) 

C  THIS  SUBROUTINE  MULTIPLIES  MATRIX  A(MXN)  TIMES  VECTOR  X(NXl) 
C  I.E.,  COMPUTES  Y  =  A*X 
C  INPUT  MATRIX  A(M,N),  VECTOR  X{N) 

C  OUTPUT  MATRIX  Y(M) 

C  X  AND  Y  MUST  BE  IN  DIFFERENT  STORAGE  LOCATIONS 
C  DIMENSION  A(M,N) ,X(1) ,Y(1) 

DIMENSION  A(ll,ll) »X(1) ,Y(1) 

DO  100  1=1, M 
TEMP=0.0 
DO  110  J=1,N 
TEMP=TEMP+A(I,J) *X(J) 

110  CONTINUE 
Y(I)=TEMP 
100  CONTINUE 
RETURN 
END 


jicograin  Unit  Length=00B7  (183)  Bytes 
•■'ata  Area  Length=0017  (23)  Bytes 

Sjubroutines  Referenced: 


$L1 

$T1 

?M9 

$MB 

$AB 

’.1.  . 

•.2,ariables : 

^  0001" 

X 

0003" 

M  0005" 

W  0007" 

Y 

0009" 

I  OOOB" 

TEMP  OOOD" 

T^:  010000 

i^bels : 

J 

0015" 

0011" 

T:000000 

^OL  009E' 

llOL 

006D' 

0013" 


m 


m 
I 


300 
23400 

t500 
600 
23700 


SUBROUTINE  CROSP( X , Y ,M, CP ) 

C  THIS  SUBROUTINE  COMPUTES  THE  CROSSPRODUCT  OF  VECTORS  X  AND  Y 
C  INPUT  VECTORS  X(M),Y{M),  DIMENSION  M 
C  OUTPUT  CROSSPRODUCT  CP 
DIMENSION  X(l) ,Y(1) 

CP=0.0 

DO  100  1=1, M 
CP=CP+X(I)*Y(I) 

100  CONTINUE 
RETURN 
END 


Qrogram  Unit  Length=006B  (107)  Bytes 
C^ta  Area  Length=000F  (15)  Bytes 


C^ibroutines  Referenced: 

$AT 

$L1 

V 

$AB 

Variables ; 

0001" 

0007" 

Y 

0003" 

I 

0009" 

TjOlOOOO 

OOOD" 

i*-abels: 


$T1 


M  0005'' 

TrOOOOOO  OOOB" 


)0L 


0052 


23800 

SUBROUTINE  RANDN(V) 

28900 

C 

THIS  SUBROUTINE  GENERATES  STANDARD  i 

Sjooo 

C 

BY  SUMMING  12  UNIFORMLY  DISTRIBUTED 

S^lOO 

C 

OUTPUT  V  IS  THE  RANDOM  NUMBER 

24200 

A=0.0 

§300 

DO  50  1=1,12 

^400 

CALL  RANDU(Y) 

24500 

A=A+Y 

:-.4600 

50  CONTINUE 

':ji7oo 

V=A-6 . 

24800 

RETURN 

^900 

END 

DEVIATES 


S'rogram  Unit  Length=0055  (85)  Bytes 
^ata  Area  Length=000D  (13)  Bytes 

^broutii 


.nes  Referenced; 


f^l 


$T1 

$SB 


RAN  DU 


Variables : 


Y 

•/. 

vibels; 


0001” 

0009" 


000  3" 


0007" 


002D' 


25000  SUBROUTINE  RANDU(YFL) 

100  C  THIS  SUBROUTINE  GENERATES  UNIFORMLY  DISTRIBUTED  RANDOM  NUMBERS 
200  C  BY  THE  MULTIPLICATIVE  CONGRUENTIAL  ("RESIDUE")  METHOD 
300  C  OUTPUT  YFL  IS  THE  RANDOM  NUMBER 

K400  DATA  IX/32769/ 

500  IY=IX*259 

600  IF(IY)5,6,6 

25700  5  IY=IY+32767+l 

800  6  YFL=IY 

900  YFL=YFL*.30517578E-4 

26000  IX=IY 

100  RETURN 

200  END 

program  Unit  Length=004C  (76)  Bytes 
ta  Area  Length=0007  (7)  Bytes 

Subroutines  Referenced: 

SCA  STl 


kilkriables: 


YFL 

abels: 


0001" 


0003" 


0005" 


0019  ' 


6L 


0023' 


loRTRAN 

Created 

KlOO 
200 
300 
QO400 
1)500 

ileoo 

00700 

S800 
900 
01000 
W.100 
5-200 
01300 
01400 

feisoo 

Sieoo 

01700 

f«l800 

'^<■900 

02000 

^’lOO 

^200 

ff5300 

Q.2400 

tvJsOO 

^^600 

02700 

y2800 

V900 

03000 

0.3100 

:«200 

lf3300 

^400 

Usoo 

S'aeoo 

03700 

'/3800 

•:^900 

04000 

9^100 

^200 

3%300 

04400 

t500 
600 
04700 
111800 
iit900 
05000 

XlOO 
200 
300 
05400 
{;’6500 
Co600 
05700 
i6800 


-80  Ver.  3.4  Copyright  1978,  79, 
:  26-NOV-80 


By  Microsoft  —  Bytes:  22872 


PROGRAM  MREG 

PERFORMS  MULTIPLE  LINEAR  REGRESSION  ANALYSIS 

DIMENSION  XBAR(ll) ,STD{ 11 ) ,RX(121) ,R ( 66 ) , B( 11 ) , D( 11 ) , T( 10 ) 
DIMENSION  LL(IO) ,MM(10) ,SB(10) ,RY (10) ,ISAVE(11) 

DIMENSION  X(l) ,ANS(11) ,NAME(40) 

DIMENSION  RXI(10,10) ,RXJ (10,10) ,RPROD ( 10 , 10 ) 

DOUBLE  PRECISION  RX 
DOUBLE  PRECISION  FNAM1(2) 

DATA  LUN1,LUN2,LUN6/1,2,6/ 

LUNl : TERMINAL;  LUN2: PRINTER;  LUN6:DATA  FILE  FOR  INPUT 
WRITE(LUN2,80) 

80  FORMAT(13HOPROGRAM  MREG,/ 

136H  MULTIPLE  LINEAR  REGRESSION  ANALYSIS) 

WRITE (LUNl, 90) 

90  FORMAT(53H  ENTER  FILE  NAME,  1-16  ALPHA  CHARS,  LAST  CHAR  BLANK:  ) 
READ ( LUN 1 , 9  5 ) FNAMl 
95  FORMAT ( 2 A8) 

CALL  OPEN (LUN 6, FNAMl, 80) 

WRITE ( LUN  2,209) FNAMl 
>09  FORMAT ( 17 HODATA  FILE  NAME:  ,2A8) 

WRITE (LUN 1,201) 

>01  FORMAT(46H  ENTER  NO  OF  OBS,  NO  OF  VARS,  INDEX  OF  DEP  VAR/ 

133H  NO  OF  INDEP  VARS  IN  REGRESSION:  ) 

READ ( LUNl , 18 ) NOBS , NVAR , IDEP , NIND 

18  FORMAT(16I5) 

WRITE (LUN 2, 16) 

16  FORMAT(40HONO  OF  OBS,  NO  OF  VARS,  INDEX  OF  DEP  VAR/ 

149H  IN  REGRESSION,  NO  OP  INDEP  VARS  IN  REGRESSION...) 

10=0 

WRITE( LUN  2,13) NOBS , NVAR , IDEP , NIND 
13  FORMAT(1X,8I10) 

WRliE(LUNl,19) 

19  FORMAT(44H  ENTER  INDICES  OF  INDEP  VARS,  IN  ASC  ORDER:  /) 
READ(LUN1,18) ( ISAVE ( I ) , 1=1 ,NIND) 

WRITE(LUN2,215) 

>15  FORMAT(39H  INDICES  OF  INDEP  VARS  IN  REGRESSION...) 

WRITE(LUN2,13) (ISAVE(I) ,1=1, NIND) 

WRITE (LUNl, 221) 

>21  FORMAT(33H  ENTER  MAX  NO  OF  OBSNS  TO  PRINT:  ) 

READ(LUN1,222)NPRTM 
>22  FORMAT(I4) 

NPRTM=MI NO ( N  PRTM , NOBS ) 

NOPR=0 

WRITE(LUN1,29) 

29  FORMAT ( 45 H  ENTER  1  TO  PERFORM  REGRESSION,  0  TO  SUPPRESS/) 
READ(LUN1,203)IREG 
!0  3  FORMAT(Il) 

WRITE(LUN2,12) 

12  FORMAT(8HODATA. . . ) 

WRITE(LUN2,223)NPRTM 
>23  FORMAT(8H  (FIRST  ,I4,7H  OBSNS)) 

CALL  CORRE{NOBS,NVAR,IO,X,XBAR,STD,RX,R,D,B,T,NPRTM,NOPR) 
WRITE(LUN2,32) 

32  FORMAT (27 HO MEANS  FOR  ALL  VARIABLES...) 

WRITE (LUN 2, 20) (XBAR ( I ) , 1=1 ,NVAR ) 

WRITE (LUN 2, 34) 

34  FORMAT(41HOSTANDARD  DEVIATIONS  FOR  ALL  VARIABLES...) 
WRITE(LUN2,20) (STD{I),I=1,NVAR) 


ASC  ORDER: 


TO  PERFORM  REGRESSION,  0  TO  SUPPRESS/) 


p900 

06000 

^100 

K200 

06300 

K400 
500 
600 
06700 
^800 
*i£900 
07000 
^100 
^200 
07300 
Q.7400 
J,7  500 
^7600 
07700 
W800 
•>7900 
08000 
g^lOO 
^200 
(re3oo 

OB400 

*5^500 

(^^600 

08700 

|,U800 

#900 

09000 

<19100 

(V)200 

0^300 

09400 

•|500 

^600 

09700 

W800 

i^900 

10000 

U)100 

:c^200 

10  300 
10400 
00500 
Ifeooo 

10700 

•>3800 

^900 

liooo 

UlOO 

11.200 

11300 

11400 

&500 

S1600 

11700 

^800 


WRITE(LUN2,28) 

28  FORMAT ( 2 2 HO  CORRELATION  MATRIX. . . ) 

DO  130  I=1,NVAR 

Nl=I*(I-l)/2+l 

N2=N1+I-1 

130  WRITE(LUN2,20) (R(J) ,J=N1,N2) 

20  FORMAT(1X,8F10.4) 

IF(IREG)125,125,126 
126  CONTINUE 

CALL  ORDER ( N VAR , R , I DEP , NI ND , I SAVE , RX , RY ) 

DO  104  I=1,NIND 
K1=NIND*(I-1) 

DO  104  J=1,NIND 
K=K1+J 

104  RXI(I,J)=RX(K) 

CALL  MINV(RX,NIND,DD,LL,MM) 

WRITE(LUN2,213)DD 

213  FORMAT(15HODETERMINANT  =  ,E10.4) 

IF(DD)211,211,212 

211  WRITE(LUN2,214) 

214  FORMAT (49H0 ZERO  DETERMINANT,  REGRESSION  CANNOT  BE  PERFORMED) 

GO  TO  100 

212  CONTINUE 

DO  108  I=1,NIND 
K1=NIND*(I-1) 

DO  108  J=1,NIND 
K=K1+J 

108  RXJ(I,J)=RX(K) 

WRITE(LUN2,25) 

25  FORMAT (61 HO  INVERSE  OF  THE  PART  OF  THE  CORR  MATRIX  USED  IN  REGRESSI 
ION. . . ) 

DO  105  I=1,NIND 
N1=NIND*(I-1)+1 
N2=N1+I-1 

105  WRITE(LUN2,20 ) (RX(J) ,J=N1,N2) 

DO  107  I=1,NIND 

DO  107  J=1,I 
RP=0 

DO  106  K=1,NIND 

106  RP=RP+RXI(I,K)*RXJ(K,J) 

107  RPROD(I,J)=RP 
WRITE(LUN2,24) 

24  FORMAT ( 3 3H0 PRODUCT  OF  MATRIX  AND  INVERSE...,/ 

15 2H  SHOULD  BE  IDENTITY  MATRIX.  IF  NOT,  ROUNDOFF  ERRORS/ 

252H  HAVE  RUINED  SOLUTION...  DISREGARD  REST  OF  ANALYSIS.) 

DO  103  I=1,NIND 

Nl=l 

N2=I 

103  WRITE(LUN2,20 ) (RPROD(I,J) ,J=N1,N2) 

CALL  MULTR ( NOBS , NI ND , X BAR , STD , D , RX , RY , I SAVE , B , SB , T , ANS ) 

DO  102  I=1,NIND 
N1=NIND*(I-1)+1 
N2=N1+I-1 
JJ=0 

DO  102  J=N1,N2 
JJ=JJ+1 

102  RX(J)=RX(J)/(STD(I)*STD(JJ)*(NOBS-l) ) 

WRITE(LUN2,26) 

26  FORMAT(67HO INVERSE  OF  THE  PART  OF  THE  CROSS  PROD  MATRIX  USED  IN  RE 
IGRESSION. . . ) 


REGRESSION  =  ,110) 


REGRESSION. . . ) 


■L900  DO  101  1=1, KIND 

12000  N1=NIND*(I-1)+1 

MlOO  N2=N1+I--1 

K200  101  WRITE(LUN2,33)  (RX(J)  ,J=N1,N2) 

12300  33  FORMAT(1X,8(2X,E10.4) ) 

^400  WRITE(LUN2,35)IDEP 

KSOO  35  FORMAT (34H0  INDEX  OF  DEP  VAR  IN  REGRESSION  =  ,110) 

¥2600  WRITE(LUN2,36) 

12700  36  FORMAT(39H  INDICES  OF  INDEP  VARS  IN  REGRESSION...) 

t800  WRITE(LUN2,21)  (ISAVE(I )  ,I=1,NIND) 

900  21  FORMAT (IX, 8 110) 

13000  WRITE(LUN2,38) 

piOO  38  FORMAT(27H  REGRESSION  COEFFICIENTS...) 

^200  WRITE{LUN2,20)  (B  ( I )  ,  1=1  ,NIND) 

13300  WRITE{LUN2,40) 

.1^400  40  FORMAT(12H  T-VALUES...) 

mSOO  WRITE(LUN2,20)  (T(I )  ,1=1  ,NIND) 

13600  DO  37  I=1,NIND 

13700  37  T(I)=B(I)/T(I) 

^"^800  WRITE(LUN2,41) 

SB900  41  FORMAT(25H  STD  DEVS  OF  REG  COEFS. . . ) 

14000  WRITE(LUN2,20) (T{ I ) , 1=1 ,NIND) 

t4100  WRITE(LUN2,42)ANS(1) 

4200  42  FORMAT(12H  INTERCEPT  =,F10.4) 

4300  WRITE(LUN2,47)ANS(3) 

14400  47  FORMAT(27H  RESIDUAL  STANDARD  ERROR  =  ,F10.4) 

^500  WRITE(LUN2,43)ANS(2) 

i*4600  43  FORMAT(42H  SAMPLE  MULTIPLE  CORRELATION  COEFFICIENT  =,F10.4) 

14700  WRITE(LUN2,45)ANS(11) 

^800  45  FORMAT(38H  SAMPLE  COEFFICIENT  OF  DETERMINATION  =,F10.4) 

B900  WRITE(LUN2,48)ANS(4) 

15000  48  FORMAT! 50H  SUM  OP  SQUARES  ATTRIBUTABLE  TO  REGRESSION,  SSAR  =, 

?5100  1F14.4) 

5200  WRITE(LUN2,49)ANS(5) 

5300  49  FORMAT! 42H  DEGREES  OF  FREEDOM  ASSOCIATED  WITH  SSAR  =,F10.4) 

^400  WRITE(LUN2,50)ANS(6) 

1^500  50  FORMAT! 22H  MEAN  SQUARE  OF  SSAR  =,F14.4) 

15600  WRITE!LUN2,51)ANS!7) 

15700  51  FORMAT! 53H  SUM  OF  SQUARES  OF  DEVIATIONS  FROM  REGRESSION,  SSDR 

y5800  1F14.4) 

X5900  WRITE!LUN2,52)ANS!8) 

16000  52  FORMAT! 42H  DEGREES  OF  FREEDOM  ASSOCIATED  WITH  SSDR  =,F10.4) 

^100  WRITE!LUN2,53)ANS(9) 

|16200  53  FORMAT!22H  MEAN  SQUARE  OF  SSDR  =,F14.4) 

16300  WRITE!LUN2,54)ANS!10) 

;5400  54  FORMATdOH  F-VALUE  =,F14.4) 

^500  125  CONTINUE 

|P5600  100  CONTINUE 

16700  END 

^ogram  Unit  Length=094E  (2382)  Bytes 
Data  Area  Length=10D5  (4309)  Bytes 


ibroutines  Referenced: 


CORRE 

I^DER 


$13 

$INIT 

$R2 

$M9 

$L1 


$11 

$W2 

OPEN 

$D9 

$T1 


$CA 

MULTR 


$DB 


0001” 

023D” 

039D" 

03ED" 

0453" 

04D3" 

0983" 

0997" 

0A89" 

0B3E" 

0B9A" 

0C7B" 

0C96" 


yp  0D36" 

i^oooooi 


0DE9" 


STD 

002D" 

B 

0345" 

LL 

03C5" 

RY 

0415" 

ANS 

0457" 

RXJ 

0663" 

LUNl 

0993" 

NOBS 

0A85" 

NIND 

0A8B" 

T:000000 

IREG 

OBCF" 

J 

0C7D" 

TiOlOOOO 

T:020000 

0B40" 

0C98" 

0D3A" 


RX 

00  59" 

D 

0371" 

MM 

03D9" 

I  SAVE 

04  3D" 

NAME 

0483" 

RPROD 

07  F3" 

LUN2 

0995" 

NVAR 

0A87" 

10 

OAFB" 

NPRTM 

0B94" 

N1 

0C79" 

K1 

0C94" 

DD 

0C9A" 

JJ 

0DE7" 

i^^bels: 


0006' 

80L 

0999" 

90L 

09D4" 

OAOE" 

209L 

0A13" 

201L 

0A2D" 

0A93" 

16L 

0A99" 

13L 

0B03" 

OBOC" 

215L 

0B42" 

221L 

0B6E" 

222L 

0B96" 

29L 

0B9C" 

20  3L 

OBDl" 

yiZL 

0BD5" 

223L 

OBEl " 

32L 

OCIO" 

VL 

0C7F" 

34L 

0C30" 

28L 

0C5E" 

130L 

027A' 

125L 

093F* 

126L 

02D2' 

V)4L 

0303  ' 

213L 

0CA4" 

211L 

0381' 

5i2L 

0390  • 

214L 

OCBE  " 

lOOL 

093F' 

rbsL 

03B5’ 

25L 

0CF4" 

105L 

042F' 

^7L 

04F5  ' 

106L 

0497  ' 

24L 

0D3C" 

■!b3L 

055D' 

102L 

05F6’ 

26L 

ODED" 

xOlL 

068F' 

33L 

0E35" 

35L 

0E45" 

36L 

0E70" 

21L 

0E9C" 

38L 

0EA5" 

;-:oL 

0EC5" 

37L 

07C7  ' 

41L 

0ED6" 

f.'2L 

0EF4" 

47L 

OFOB" 

43L 

0F31" 

45L 

0F66" 

48L 

0F97" 

49L 

0FD4" 

1009" 

51L 

102A" 

52L 

106A" 

109F" 

54L 

10  CO" 

16800 

SUBROUTINE  DATA(M 

,D, NPRTM, NOPR) 

’^900 

DIMENSION  D(l) 

#7000 

LUN2=2 

17100 

LUN6=6 

;^2oo 

READ(LUN6,10) (D(I) ,I=1,M) 

^?7  300 

10 

FORMAT(8F10.4) 

17400 

NOPR=NOPR+l 

^7500 

IF(NOPR.GT.NPRTM)GO  TO  12 

Ip  600 

WRITE (LUN2, 11) (D(I) ,I=1,M) 

^700 

12 

CONTINUE 

y800 

11 

FORMAT{1X,8F10.4) 

57900 

RETURN 

^8000 

END 

Brogram 

Unit 

Length=00D5  (213) 

Bytes 

ita  Area  Length=0025  (37)  Bytes 
ibroutines  Referenced; 


iriables : 


labels: 

Loo 

18200 

?$300 

'•:34oo 

18500 

g^600 

|S700 

Rooo 

3jB900 

jfeooo 

«100 

19200 

*300 
400 
19500 
i'^600 
r^700 
13800 
^900 

pooo 

^100 

20200 


0001" 

0007" 

OOOD" 


0011" 


D  0003" 
LUN2  0009" 
T;000000 


12L  OODO' 


OOOF" 


NPRTM  0005" 
LUN6  OOOB" 
T:000002 


0019" 


OOIA" 


SUBROUTINE  ORDER (M, R , NDEP , K, I SAVE , RX, RY ) 
DIMENSION  R(l) ,ISAVE(1) ,RX(1) ^RY (1) 
DOUBLE  PRECISION  RX 
MM=0 

DO  130  J=1,K 
L2=ISAVE(J) 

IF(NDEP-L2)122,123,123 

122  L=NDEP+(L2*L2-L2)/2 
GO  TO  125 

123  L=L2+(NDEP*NDEP-NDEP)/2 
125  RY(J)=R(L) 

DO  130  1=1, K 
L1=ISAVE(I) 

IF(L1-L2)127,128,128 

127  L=Ll+(L2*L2-L2)/2 
GO  TO  129 

128  L=L2+(Ll*Ll-Ll)/2 

129  MM=MM+1 

130  RX(MM)=R(L) 

ISAVE(K+1)=NDEP 

RETURN 

END 


■program  Unit  Length=0lAC  (428)  Bytes 
Data  Area  Length=001D  (29)  Bytes 

subroutines  Referenced; 

§,AT  $M9 

gLl  $T1 

Variables; 


< 

0001" 

R 

0003" 

NDEP 

0005" 

K 

0007" 

I  SAVE 

0009" 

RX 

OOOB" 

OOOD" 

MM 

OOOF" 

J 

0011" 

K 

0013" 

T;000000 

0015"  L 

0017" 

T 

0019" 

Ll 

OOIB" 

^tebels ; 

130L 

013D' 

122L 

004A' 

123L 

0072' 

009B' 


127L 


OOFl 


128L 


0115  ' 


0136' 


!p300  SUBROUTINE  ARRAY (MODE, I ,J,N,M,S, D) 

[)400  DIMENSION  S(l)  ,D(1) 

0500  NI=N-I 

apeoo  IP(MODE-l) 100,100,120 

«700  100  IJ=I*J+1 

Ibsoo  NM=N*J+1 

20900  DO  110  K=1,J 

KOOO  NM=NM-NI 

100  DO  110  L=1,I 

21200  IJ=IJ-1 

|a.300  NM=NM-1 

R4OO  110  D(NM)=S{IJ) 

21500  GO  TO  140 

teOO  120  IJ=0 

700  NM=0 

800  DO  130  K=1,J 

21900  DO  125  K=1,I 

;;«2000  IJ=IJ+1 

,V2100  NM=NM+1 

22200  125  S(IJ)=D(NM) 

M300  130  NM=NM+NI 

^400  140  RETURN 

fi500  END 


l^ogram  Unit  Length=0168  (360)  Bytes 
b,kta  Area  Length=001B  (27)  Bytes 


kibroutines  Referenced: 


Variables : 


ibels 


fOOL 

140L 


612600 

22700 

>.!2800 

SS2900 

23000 

^100 

&200 

^300 

23400 

«^500 

i-aeoo 

23700 

|p800 


0001" 

I 

0003 

0007" 

M 

0009 

OOOD" 

NI 

OOOF 

0013" 

NM 

0015 

0019" 

003D' 

120L 

00E6 

0167  ' 

130L 

0148 

TiOOOOOO 


0005" 

OOOB" 

) 

0017" 


0011" 


llOL 

125L 


0093' 

OlOC 


SUBROUTI NE  MULTR (N , K , XBAR , STD , D , RX , RY , I SAVE , B , SB , T , ANS ) 
DIMENSION  XBAR(l) ,STD(1) ,D(1) ,RX(1) ,RY(1) ,ISAVE(1) ,B(1) ,SB(1) 
DIMENSION  T(1),ANS(1) 

DOUBLE  PRECISION  RX 
MM=K+1 

DO  100  J=1,K 
100  B(J)=0.0 

DO  110  J=1,K 
L1=K*(J-1) 

DO  110  1=1, K 
L=L1  +  I 

110  B(J)=B(J)+RY (I )*RX(L) 

RM=0.0 


S900 

24000 

MIOO 

^200 

24300 

24400 

■1500 

fneoo 

24700 

tsoo 

900 

25000 

flOO 
200 
25300 
35400 
#500 
*2S600 
25700 
%800 
56900 
26000 
«100 
I&200 

26300 

26400 

S5500 


56600 

26700 

taoo 

900 

27000 

.'7100 

J-7200 

2*7300 

^400 

#500 


B0*0.0 

L1=ISAVE(MM) 

DO  120  1=1, K 
RM=RM+B(I)*RY(I) 

L=ISAVE(I) 

B(I)=B{I)*STD(L1)/STD(L) 

120  BO=BO+B(I)*XBAR(L) 

B0=XBAR(L1)-B0 

SSAR=W4*D(L1) 

SSDR=D(L1)-SSAR 

FN=N-K-1 

SY=SSDR/PN 

DO  130  J=1,K 

Ll=K*(J-l)+J 

L=ISAVE(J) 

C  125  SB(J)=DSQRT{DABS( (RX{L1)/D(L))*SY) ) 
125  SB(J)=SQRT(ABS( (RX(Ll) /D(L) ) *SY) ) 
130  T(J)=B(J)/SB(J) 

135  SY=SQRT{ABS (SY) ) 

FK=K 

SSARM=SSAR/FK 
SSDRM=SSDR/FN 
F=SSARM/SSDRM 
ANS(11)=RM 
122  RM=SQRT(ABS(RM) ) 

ANS ( 1 ) =BO 

ANS(2)=RM 

ANS(3)=SY 

ANS(4)=SSAR 

ANS(5)=FK 

ANS(6)=SSARM 

ANS(7)=SSDR 

ANS(8)=FN 

ANS(9)=SSDRM 

ANS(10)=F 

RETURN 

END 


Program  Unit  Length=04B0  (1200)  Bytes 
^^ata  Area  Length=0065  (101)  Bytes 

Subroutines  Referenced; 


V^PRT 

$L1 


Variables : 

fc  00 


.>•;  010000 
'';020000 
SSDR 

§:000001 


0001" 

K 

000  3" 

XBAR 

0005 

0007" 

D 

0009" 

RX 

OOOB 

OOOD" 

I  SAVE 

OOOF" 

B 

0011 

0013" 

T 

0015" 

ANS 

0017 

0019" 

J 

OOIB" 

T;000000 

OOIF" 

I 

0021" 

L 

0023 

0025" 

RM 

0027" 

BO 

002B 

002P" 

T;030000 

00  31" 

SSAR 

0033 

0037" 

PN 

003B" 

SY 

003F 

0043" 

T.'OlOOOl 

0047" 

FK 

004B' 

OOID" 


•arm  00  4  F” 

T:040000 

®|070000 


00  5B" 
0061" 


SSDRM  0053" 

T;050000 

TtOSOOOO 


005D" 

0063" 


F  0057" 

T;060000 


005F" 


Labels: 

||0L 

"OL 

122L 


»'600 

27700 

S800 
900 
2"&000 
2^100 
^M200 
£^300 
28400 
.^500 
W600 
28700 
^800 
j&900 

Rooo 

29100 

»200 

i^300 

29400 

gd500 

•eoo 

29700 

;h8800 

t^900 

toooo 

^100 

1^200 

5^303 

30400 

>)500 

*■5)600 

30700 

^800 

•.V)900 

3-iOOO 

31100 

^200 

^300 

31400 

.5.500 

:-Jl600 

31700 

ri800 

||L900 

f^OOO 

^100 

$200 

96300 

32400 

|p500 


0020  • 
02F3' 
03B8  ' 


llOL 

125L 


007A' 
0290  • 


120L 

135L 


0194  ' 
0343' 


C  10 
10 
15 


SUBROUTINE  MINV(A,N,DD,L,M) 
DIMENSION  A(l) ,L{1) ,M(1) 

DOUBLE  PRECISION  A,D, BIGA,HOLD 

D=1.0 

NK=-N 

DO  80  K=1,N 

NK=NK+N 

L{K)=K 

M(K)=K 

KK=NK+K 

BIGA=A(KK) 

DO  20  J=K,N 
IZ=N*(J-1) 

DO  20  I=K,N 
IJ=IZ+1 

IF(DABS{BIGA)-DABS(A{IJ) ) )15,20,20 
IF(ABS(BIGA)-ABS(A(IJ) ) ) 15, 20, 20 
ABIGA=A(IJ) 

L(K)  =  I 
M(K)=J 
CONTINUE 
J=L{K) 

IF(J-K)35,35,25 
KI=K-N 
DO  30  1=1, N 
KI=KI+N 
HOLD=-A(KI ) 

JI=KI-K+J 
A(KI )=A(JI) 

A{JI)=HOLD 

I=M(K) 

IF(I-K)45,45,38 

JP=N*{I-1) 

DO  40  J=1,N 
JK=NK+J 
JI=JP+J 
HOLD=-A(JK) 

A(JK)=A(JI) 

A{JI)=HOLD 

IF(BIGA)48 ,46,48 

D=0.0 

GO  TO  150 

DO  55  1=1, N 

IF(I-K)50,55,50 

IK=NK+I 

A(IK)=A(IK)/(-BIGA) 

CONTINUE 
DO  65  1=1, N 
IK=NK+I 
HOLD=A{IK) 


m 


>4500 

;^56oo 

34700 

:m8oo 

^900 

Sooo 

35100 

1^200 

W300 

35400 

^500 

0600 

35700 

7^5800 

:$d900 

36000 

^100 

i|200 

5^300 

36400 

^^500 

^600 

36700 

1^800 

t4900 


IJ=I-N 
DO  65  J=1,N 
IJ=IJ+N 

IF(I-K)60,65,60 
60  IF(J-K)62,65,62 
62  KJ«IJ-I+K 

A ( IJ ) =HOLD* A (KJ ) +A ( IJ ) 
65  CONTINUE 
KJ=K-N 
DO  75  J=1,N 
KJ=KJ+N 

IF(J-K)70,75,70 
70  A(KJ)=A(KJ)/BIGA 
75  CONTINUE 
D=D*BIGA 
A(KK)=1.0/BIGA 
80  CONTINUE 
K=N 

100  K=(K-1) 

IF(K)150,150,105 
105  I=L(K) 

IF(I-K)120,120,108 
108  JQ=N*{K-1) 

JR=N*(I-1) 

DO  110  J=1,N 

JK*JQ+J 

HOLD=A(JK) 

JI»JR+J 

A(JK)=-A(JI) 

110  A(IJ)=HOLD 
120  J»M{K) 

IF(J-K)100,100,125 
125  KI=K-N 

DO  130  1=1, N 
KI=KI+N 
HOLD=A{KI ) 

JI=KI-K+J 
A(KI )=-A(JI) 

130  A(JI)=HOLD 
GO  TO  100 
150  CONTINUE 
DD=D 
RETURN 
END 


Program  Unit  Length=06C3  (1731)  Bytes 
l^ta  Area  Length=0047  (71)  Bytes 

Subroutines  Referenced: 


$AT 

'$T1 

$M9 

$DB 

Variables : 

§  0001" 

N 

L  0007" 

M 

|K  OOOF" 

K 

$L1 

$SB 

$MB 


0003" 

DD 

0005 

0009" 

D 

OOOB 

0011" 

KK 

0013' 

I  Vjp V. 


KGA 

T;020000 

\  ^010001 
[  HOLD 


Labels: 


QfflL 

15L 

150L 

e^5L 


t^5L 

llOL 

'V 

';-7000 

37100 

>7200 

j;Jf300 

*400 

37500 

K'^eoo 

^V700 

37800 

|Hf900 

ftooo 

38100 

5fl200 

^3300 

38400 

|£500 

^600 

^8700 

38800 

^3900 

Ssooo 

39100 
^200 
::9  3oo 
'39400 
3.9500 
!S;9600 
W700 
39800 
v9900 
>0000 
40100 
.;0200 
TO300 
T0400 
40500 

'.peoo 

■■4-0700 

40800 

B3900 


0015" 

T:000000 

OOID" 

J 

OOIF 

0023" 

IJ 

0025 

002B" 

ABIGA 

002F 

0035" 

JI 

0039 

00  3D  " 

IK 

003F 

0043" 

JR 

0045 

04EB' 

20L 

012D 

00E6' 

35L 

021C 

OlEB' 

45L 

02D9 

02A8  ' 

48L 

02F1 

06AE' 

55L 

033B 

0418' 

60L 

03B6 

04A2  ' 

70L 

0482 

051B' 

120L 

05  E4 

05B3  ' 

125L 

060E 

0019" 


T:010000 
IZ  0021" 

TrOOOOOl 
KI  0033" 

JP  003B" 

KJ  0041" 


OOIB" 


0027" 


lOL 

25L 

38L 

46L 

50L 

62L 

lOOL 

108L 

130L 


00B4' 
017F' 
0246  ' 
02  E2  ' 
0310  ' 
03CF' 
0509  ' 
0545  ' 
067A' 


SUBROUTINE  CORRE(N,M, 10 ,X , XBAR, STD, RX , R , B, D, T, NPRTM, NOPR ) 

DIMENSION  X(l) ,XBAR(1) ,STD(1) ,RX(1) ,R(1),B(1),D(1),T(1) 

DOUBLE  PRECISION  RX 

DO  100  J=1,M 

B(J)=0.0 

T(J)=0.0 

K=(M*M+M)/2 

DO  102  1=1, K 

R(I)=0.0 

FN=N 

L=0 

IF(IO)105,127,105 
DO  108  J=1,M 
DO  107  1=1, N 
L=L+1 

T(J)=T(J)+X(L) 

XBAR( J)=T(J) 

T(J)=T( J)/FN 
DO  115  1=1, N 
JK=0 
L=I-N 

DO  110  J=1,M 
L=L+N 

D(J)=X(L)-T(J) 

B( J)=B(J)+D(J) 

DO  115  J=1,M 
DO  115  K=1,J 
JK=JK+1 

R(JK)=R(JK)+D( J)*D(K) 

GO  TO  20b 

IF(N-M)130,130,135 

KK=N 

GO  TO  137 
KK=M 

DO  140  1=1, KK 

CALL  DATA  (M , D , NPRTM , NOPR ) 

DO  140  J=1,M 
T(J)=T(J)+D( J) 

L=L+1 

RX(L)=D(J) 


|.000 

41100 

^200 

B.300 

^400 

K500 
600 
700 
41800 

t900 
000 
42100 
4,2200 
^300 
42400 
42500 

>i^600 

“^2700 

42800 

.:i2900 

Njaooo 

43100 
^^200 
1^300 


1400 


4^500 

V^600 

!W700 

43800 

^900 

pooo 

44100 

fl^4200 

'::4300 

45400 

44500 

l|600 

V1700 

44800 

•^-4900 

^^000 

45100 

«200 

.f3300 

45400 

45500 

'-.^600 

$5700 

45800 

^^•5900 


t^OOO 


46100 

i6200 

P300 

t6400 

46500 

(>&600 

.^700 

46800 


FKK=KK 

DO  150  J=1,M 
XBAR(J)=T(J) 

150  T(J)=T(J)/FKK 
L=0 

DO  180  1=1, KK 
JK=0 

DO  170  J=1,M 
L=L+1 

170  D(J)=RX(L)-T(J) 

DO  180  J=1,M 
B(J)=B(J)+D(J) 

DO  180  K=1,J 
JK=JK+1 

180  R{JK)=R(JK)+D( J)*D(K) 
IF(N-KK)205,205,185 
185  KK=N-KK 

DO  200  1=1, KK 
JK=0 

CALL  DATA  (M , D, NPRTM , NOPR ) 

DO  190  J=1,M 

XBAR( J)=XBAR(J)+D(J) 

D(J)=D(J)-T(J) 

190  B(J)=B(J)+D(J) 

DO  200  J=1,M 
DO  200  K=1,J 
JK=JK+1 

200  R(JK)=R(JK)+D(J)*D(K) 

205  JK=0 

DO  210  J=1,M 
XBAR ( J ) =XBAR ( J ) /FN 
DO  210  K=1,J 
JK=JK+1 

210  R(JK)=R(JK)-B{J)*B{K)/FN 
JK=0 

DO  220  J=1,M 
JK=JK+J 

220  STD(J)=SQRT(ABS (R( JK) ) ) 

DO  230  J=1,M 
DO  230  K=J,M 
JK=J+(K*K-K)/2 
L=M*(J-1)+K 
RX(L)=R(JK) 

L=M* (K-l)+J 
RX(L)=R{JK) 

IF(STD(J)*STD(K) )225,222,225 
222  R(JK)=0.0 
GO  TO  230 

225  R{JK)=R(JK)/(STD(J)*STD(K) ) 
230  CONTINUE 

FN=SQRT(FN-1.0) 

DO  240  J=1,M 
240  STD( J)=STD( J)/FN 
L=-M 

DO  250  1=1, M 
L=L+M+1 
250  B(I)=RX(L) 

RETURN 

END 


^ogram  Unit  Length=0A03  (2563)  Bytes 
Data  Area  Length=0043  (67)  Bytes 


babroutines  Referenced; 


^RT 

ABS 

$AT 

Kl 

$T1 

$M9 

fD9 

$CA 

$AB 

$DB 

$SB 

$NB 

r« 

DATA 

m 

Variables : 

S 

0001" 

M 

000  3" 

10 

0005 

0007" 

XBAR 

0009" 

STD 

OOOB 

RX 

OOOD" 

R 

OOOF" 

B 

0011 

0013" 

T 

0015" 

NPRTM 

0017 

tfepR 

0019" 

J 

OOIB" 

TtOOOOOO 

K 

OOIF" 

I 

0021" 

FN 

0023 

0027" 

T;010000 

0029" 

JK 

002B 

'  >.<: 020000 

002D" 

KK 

002F" 

FKK 

0035 

T:030000 

00  3D" 

T;000001 

003F" 

bels: 


lOOL 

0032  ' 

102L 

0088  • 

105L 

OODB' 

ilfe7L 

02CF' 

108L 

0155' 

107L 

00  EE' 

1a.5L 

024F' 

llOL 

OlFD' 

205L 

06C4' 

130L 

02F2  ' 

135L 

02  FF' 

137L 

0309  ' 

^OL 

0360' 

150L 

03E3' 

180L 

04C7  ’ 

VOL 

0436  ' 

185L 

055F' 

200L 

064B' 

190L 

05F9  ’ 

210L 

06FD' 

220L 

0786’ 

j?^0L 

r^40L 

0918  ' 

225L 

08C7  ' 

222L 

08A7  ’ 

0964  ' 

250L 

09BE' 

i 


5 


i 


■< 

V 


I 


00  ID" 


•Td 


TORTRAN 

Created 

^100 

n200 

(rbaoo 

K400 
500 
600 
00700 

tsoo 

900 

01000 

illoo 

35.200 

ai300 

ai400 

M.500 

^600 

01700 

ft.800 

yO-900 

02000 

MlOO 

I&200 

0^300 

0.2400 

t-:2500 

t.2600 

02700 

8800 
900 
03000 
^'^lOO 
,‘v^200 
?J3300 
IU400 
TO500 
tv3600 
03700 
^800 
1«900 
04000 
WlOO 
>^4200 
04300 
A4400 
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#}600 
04700 
34800 
^900 
05000 

KlOO 
200 
300 
05400 
;ffi500 
>^5600 
05700 

e800 


-80  Ver.  3.4  Copyright  1978,  79, 
;  26-NOV-80 


(C)  By  Microsoft  —  Bytes:  22872 


OF  VARS,  INDEX  OF  DEP  VAR/ 
;iON:  ) 


PROGRAM  ITEREG 

ESTIMATES  REGRESSION  COEFFTS  BY  AN  ITERATIVE  "WALD/ BARTLETT" 
METHOD. 

INPUT  DATA  READ  FROM  FILE. 

DIMENSION  Y(IOO) , XX (100, 11) ,X ( 100 ) ,XORD( 100) ,INDX{100) 

DIMENSION  IRANKdOO)  ,XBAR(11)  ,IIND(10)  ,IX(100,10)  ,COEF(ll) 
DIMENSION  RIQ(ll) ,RMED(11) 

DOUBLE  PRECISION  FNAMl { 2 ) 

DATA  LUN1,LUN2,LUN6/1,2,6/ 

LUNl: TERM INAL;  LUN 2; PRINTER;  LUN6:DATA  FILE  FOR  OUTPUT 
WRITE(LUN2,80) 

80  FORMAT (15 HO  PROGRAM  ITEREG,/ 

157H  ESTIMATES  REGRESSION  COEFFICIENTS  BY  AN  ITERATIVE  METHOD) 
WRITE (LUNl, 90) 

90  FORMAT (53H  ENTER  FILE  NAME,  1-16  ALPHA  CHARS,  LAST  CHAR  BLANK:  ) 
READ( LUNl , 95 ) FNAMl 
95  FORMAT ( 2 A8) 

CALL  OPEN (LUN 6, FNAMl, 80) 

WRITE ( LUN  2,209) FNAMl 
209  FORMAT ( 17 HODATA  FILE  NAME:  ,2A8) 

WRITE (LUNl, 201) 

201  FORMAT(46H  ENTER  NO  OF  OBS ,  NO  OF  VARS,  INDEX  OF  DEP  VAR/ 

13 3H  NO  OF  INDEP  VARS  IN  REGRESSION:  ) 

READ( LUNl, 18) NOBS, NVAR,IDEP,NIND 

18  FORMAT(16I5) 

WRITE(LUN2,16) 

16  FORMAT (40H0 NO  OF  OBS,  NO  OF  VARS,  INDEX  OF  DEP  VAR/ 

149H  IN  REGRESSION,  NO  OF  INDEP  VARS  IN  REGRESSION...) 

10-0 

WRITE ( LUN2 , 1 3 ) NOBS , NVAR , IDEP , NIND 
13  FORMAT(1X,8I10) 

WRITE(LUN1,19) 

19  FORMAT(44H  ENTER  INDICES  OF  INDEP  VARS,  IN  ASC  ORDER:  /) 
READ(LUN1,18) ( IIND( I ) , 1=1 ,NIND) 

WRITE(LUN2,215) 

215  F0RMAT(39H  INDICES  OF  INDEP  VARS  IN  REGRESSION...) 

WRITE(LUN2,13) (IIND(I ) ,1=1, NIND) 

WRITE (LUNl, 221) 

221  FORMAT (33H  ENTER  MAX  NO  OF  OBSNS  TO  PRINT;  ) 

READ( LUNl , 222 ) NPRTM 

222  FORMAT(I4) 

NPRTM=MIN0 (NPRTM, NOBS ) 

NOPR=0 

WRITE( LUNl, 230) 

230  FORMAT(49H  INPUT  0  TO  USE  WALD  EST,  1  TO  USE  BARTLETT  EST;  ) 

READ( LUNl, 231 )IWB 

231  FORMAT(Il) 

IF(IWB.GT.0)GO  TO  232 
WRITE(LUN2,233) 

233  FORMAT (56H0 REGRESSION  METHOD  USED  AT  EACH  ITERATION;  WALD'S  METHOD 

1) 

GO  TO  234 

232  CONTINUE 
WRITE(LUN2,235) 

235  FORMAT ( 60H0 REGRESSION  METHOD  USED  AT  EACH  ITERATION:  BARTLETT'S  ME 
ITHOD) 

234  CONTINUE 
WRITE(LUN1,1239) 


IN  REGRESSION...) 


TO  USE  BARTLETT  EST; 


1 


S900 

06000 

glOO 
200 
06300 

X400 
500 
600 
^700 
#800 
iE900 
07000 
IflOO 
®200 
07300 
07400 
i»500 
§7600 
07700 
§^800 
^900 
08000 
ftfllOO 
(^200 
(re300 
08400 
^5500 
(ftieoo 
08700 

*800 
900 
09000 
(WlOO 
^200 
09300 
y4oo 
#500 
&9600 
09700 
f*.J800 

(•>hoo 

10000 

^100 

^*^3200 

lt)300 

10400 

®500 


!«)600 

10700 

^800 

;‘O900 

11000 

^100 

B-200 

n3oo 

U400 

.^500 

5^1600 

11700 

||.800 


1239  PORMAT(36H  ENTER  NO  OF  ITERATIONS  TO  PERFORM:  ) 

READ ( LUNl , 12 50 ) NI TER 
1250  FORMAT(I3) 

WRITE(LUN1,1264) 

1264  FORMAT(44H  PRINT  RESULTS  EVERY  N  ITERATIONS;  INPUT  N:  ) 
READ(LUN1,1250)NPRT 
KPRT=0 

WRITE(LUN2,1262)NPRT 

1262  FORMAT (19H0 PRINT  AFTER  EVERY  ,I3,11H  ITERATIONS) 
WRITE(LUN2,12) 

12  FORMAT ( 8H0DATA . . . ) 

WRITE(LUN2 , 223 )NPRTM 

223  FORMAT (8H  (FIRST  ,I4,7H  OBSNS)) 

:  READ  DATA 

NINDP=NIND+1 
DO  1110  J=l,NOBS 
READ(LUN6,10) ( X ( I ) , 1=1 ,NVAR) 

10  FORMAT (8F10. 4) 

NOPR=NOPR+l 

IF(NOPR.GT.NPRTM)GO  TO  224 
WRITE(LUN2,11 ) ( X ( I ) , 1=1 ,NVAR ) 

224  CONTINUE 

11  FORMAT(1X,8F10.4) 

Y(J)=X(IDEP) 

DO  1112  I=1,NIND 
INDEX=IIND(I ) 

XX(J,I)=X(INDEX) 

1112  CONTINUE 

XX(J,NINDP)=X(IDEP) 

1110  CONTINUE 
:  COMPUTE  MEANS 

DO  1120  I=1,NINDP 
COEF(I)=0.0 
XBARd  )=0.0 
DO  1130  J=l,NOBS 
XBARd  )=XBAR(I  )+XX(J,I) 

1130  CONTINUE 

XBAR ( I ) =XBAR ( I ) /NOBS 
1120  CONTINUE 

WRITE(LUN2,32) 

32  FORMAT ( 3 5H0 MEANS  FOR  VARIABLES  IN  REGRESSION,/ 

127 H  DEPENDENT  VARIABLE  LAST...) 

WRITE(LUN2,20)  (XBARd  )  ,I=1,NINDP) 

20  FORMAT (IX, 8 FIO. 4) 

COEF(NINDP)=XBAR(NINDP) 

:  ORDER  THE  X'S 

DO  1160  I=1,NIND 
DO  1170  J=l,NOBS 
X(J)=XX(J,I) 

1170  CONTINUE 

CALL  RANK{X,NOBS,XORD,INDX,IRANK) 

DO  1180  J=l,NOBS 
XX(J,I)=XORD(J) 

IX(J,I)=INDX(J) 

1180  CONTINUE 

:  DETERMINE  MEDIAN,  INTERQUARTILE  RANGE 
IQ1=.25*N0BS 
IQM=.5*NOBS 
IQ2=.75*NOBS 
RMEDd  )=XORD(IQM) 


^900 

12000 

^100 

p200 

12300 

12400 

Bsoo 

ileoo 

12700 

S800 
900 
13000 

8100 
200 
300 
13400 
P500 
^600 
13700 
?3800 
^900 
14000 
i4100 
IJ200 
ft300 
14400 
{^500 
C-<1l600 
14700 
^800 
9900 
15000 
15100 
J:-5200 
1^300 

!5400 
E500 
o600 
15700 
tB800 
I1&900 
16000 
#100 
K6200 
16300 
,1^400 
i^500 
¥6600 
16700 

<o800 

1?5900 

17000 

WlOO 

#200 

T7300 

17400 

5*^500 

&7600 

17700 

B7800 


1210 


1211 


1212 


RIQ(I)=X0RD(IQ2)-X0RD(IQ1) 

1160  CONTINUE 

CALL  RANK (Y, NOBS, XORD,INDX,IRANK) 

RMED ( NI NDP ) =XORD ( IQM ) 

RIQ (NINDP ) =XORD( IQ2 ) -XORD ( IQl ) 

WRITE(LUN2,22) 

22  FORMAT ( IIHOMEDI ANS. .. ) 

WRITE(LUN2,20 ) (RMED(I ) ,1=1, NINDP) 

WRITE(LUN2,21) 

21  FORMAT ( 24H0 INTERQUARTILE  RANGES. . . ) 

WRITE(LUN2,20) (RIQ (I ) ,1=1 , NINDP) 

WRITE(LUN2,1235) 

1235  FORMAT (38H0 REGRESSION  COEFFICIENTS,  INTERCEPT...) 
IF(NITER.LE.O)GO  TO  126 

125  CONTINUE 

DO  1300  ITER=1, NITER 
C  COMPUTE  REGRESSION  COEFFT  ADJUSTMENTS 
DO  1200  I=1,NIND 
DO  1210  J=l,NOBS 
X(J)=XX(J,I) 

INDX(J)=IX(J, I) 

1210  CONTINUE 
IF(IWB.GT.0)GO  TO  1211 
CALL  WALD(Y,X,NOBS,INDX,A) 

GO  TO  1212 

1211  CONTINUE 

CALL  BART(Y,X,NOBS,INDX,A) 

1212  CONTINUE 

C  ADJUST  REGRESSION  COEFFT 
COEFd  )=COEF(I  )+A 
C  ADJUST  Y 

DO  1220  J=l,NOBS 
IY=IX(J,I) 

Y{IY)=Y(IY)-A*XX{J,I) 

1220  CONTINUE 
C  ADJUST  CONSTANT  TERM 

COEF (NI NDP ) =COEF (NINDP ) -A*XBAR ( I ) 

1200  CONTINUE 
C  PRINT  RESULTS 
KPRT=KPRT+1 

IF(KPRT.LT.NPRT)GO  TO  1270 
KPRT=0 

WRITE(LUN2,1230) ( COEF ( I ), 1=1 , NINDP ) 

1230  FORMAT(1X,8F10.4) 

1270  CONTINUE 
1300  CONTINUE 

WRITE(LUN1,1240) 

1240  FORMAT(4lH  ENTER  NO  OF  ADDL  ITERATIONS  TO  PERFORM; 
READ(LUN1, 1250) NITER 
IF(NITER.LE.0)GO  TO  126 
WRITE(LUN1,1264) 

READ( LUNl , 1250 )NPRT 
WRITE(LUN2,1262)NPRT 
GO  TO  125 

126  CONTINUE 

C  ESTIMATE  RESIDUAL  VARIANCE 
C  PUT  THE  X’S  BACK  IN  ORIGINAL  ORDER 
DO  1500  I=1,NIND 
DO  1510  J=l,NOBS 
X(J)=XX(J,I) 


V900  1510  CONTINUE 

18000  DO  1520  J=l,NOBS 

MlOO  JORIG=IX(J,I) 

S2OO  XX(JORIG,I)=X(J) 

K300  1520  CONTINUE 

3^400  1500  CONTINUE 

B5OO  YINT=COEF(NINDP) 

Reoo  Rv«=o.o 

18700  DO  1400  J=l,NOBS 

SaSOO  YEST=YINT 

DB900  DO  1410  I=1,NIND 

19000  YEST=YEST+COEF{I)*XX(J,I) 

|£100  1410  CONTINUE 

R2OO  RV=RV+(XX(J,NINDP)-YEST)**2 

19300  1400  CONTINUE 

19400  RV=RV/(NOBS-NIND-l) 

1©500  RSD=SQRT(RV) 

^600  WRITE(LUN2,1420)RV,RSD 

19700  1420  FORMAT (21H0 RESIDUAL  VARIANCE  =  ,F14.4/ 

IB800  127H  RESIDUAL  STANDARD  ERROR  =  /FI  4. 4) 

£9900  C  COMPUTE  COEFFT  OF  DETERMINATION 

20000  VY=0.0 

MlOO  DO  1430  J=l,NOBS 

§200  VY=VY+XX(J,NINDP)**2 

“300  1430  CONTINUE 

20400  VY=(VY-NOBS*XBAR(NINDP)**2)/(NOBS-l) 

jl^soo  CD=(vy-Rv)/vy 

iij^eoo  WRITE(LUN2,1435)CD 

20700  1435  F0RMAT(32H  COEFFICIENT  OF  DETEIMINATION 

£)800  END 


=  ,F10.4) 


Program  Unit  Length=0C0E  (3086)  Bytes 
jl^ta  Area  Length=2555  (9557)  Bytes 

^broutines  Referenced: 


$W2 

^NK 

.^RT 

'ibB 


$13 

SQRT 

$ND 

$L1 

$AB 

$MA 

$NB 

$MB 

$EX 


$11 

$INIT 

$R2 

$T1 

$DA 

$CH 

WALD 

$EA 


riables: 


Y 

0001" 

XX 

0191" 

X 

12C1 " 

:?'pRD 

1451" 

INDX 

15  El " 

I  RANK 

16  A9" 

'^AR 

1771" 

IIND 

179D" 

IX 

17B1" 

COEF 

1F81" 

RIQ 

IFAD" 

RMED 

1FD9" 

^AMl 

2005" 

LUNl 

2015" 

LUN2 

2017" 

wJN6 

2019" 

NOBS 

211E" 

NVAR 

2120" 

%EP 

2122" 

NIND 

2124" 

10 

2194" 

I 

21D7  " 

T:000000 

21D9" 

NPRTM 

222D" 

^PR 

2233" 

IWB 

226B" 

T;000002 

2271 

.JSlTER 

2319" 

NPRT 

2350" 

KPRT 

2352" 

NINDP 

23A3" 

J 

23A5" 

INDEX 

23BA" 

ei:oioooo 

23BC" 

T:020000 

2  3  BE  " 

T:030000 

2418 

tbels : 


^  A  4 


*900 
000 
21100 
5^200 
OL300 
‘21400 

tsoo 
600 
21700 
21800 
^4900 
a2000 
22100 
«200 
;^300 
^2400 
,22500 

vSzeoo 

K700 

22800 

&900 

>■>•3000 

23100 

«3200 

jP300 

73400 

^3500 

^3600 

^r'3700 

23800 


241A" 

IQM 

241 C" 

IQ  2 

241E" 

247E'' 

A 

2480" 

lY 

2490" 

24CB'’ 

YINT 

24CD" 

RV 

24D1" 

24D5’' 

2526" 

RSD 

24D9" 

VY 

2522" 

0006' 

80L 

201B" 

90L 

20  6D" 

20A7  " 

209L 

20  AC" 

201L 

20  C6  " 

212C" 

16L 

2132" 

13L 

219C" 

21 A5" 

215L 

21DB" 

221L 

2207" 

222F" 

230L 

2235" 

231L 

226D" 

OlBl* 

233L 

227  2" 

234L 

OlBD' 

22AF" 

1239L 

22F0  " 

1250L 

231B" 

231F" 

1262L 

2354" 

12L 

237  E" 

238A" 

lllOL 

0389  ' 

lOL 

23A7" 

02  E6' 

IIL 

23AF" 

1112L 

0349  ' 

0437  ’ 

1130L 

040C' 

32L 

23  CO" 

2407" 

1160L 

05  FA' 

1170L 

04E3' 

0562  ' 

22L 

2426" 

21L 

2436" 

245  3" 

126L 

096B' 

125L 

0714' 

08E7  ' 

1200L 

0877  ' 

1210L 

077D' 

07B0' 

1212L 

07BC' 

1220L 

083A' 

08 E7  ' 

1230L 

2492" 

1240L 

249D" 

0A16' 

1510L 

09A2' 

1520L 

0A06  ' 

OADE  ' 

1410L 

0A98  ' 

1420L 

24DD" 

0B7C'  1435L  252A" 

SUBROUTI NE  RANK ( X , N , XORD , INDX , IRANK ) 

C  THIS  SUBROUTINE  DETERMINES  THE  RANKS  OF  THE 

COMPONENTS  OF  THE 

C  VECTOR 

X,  AND  ALSO 

ORDERS  THESE 

COMPONENTS 

(ASCENDING  ORDER). 

C  INPUT; 

C  OUTPUT; 

VECTOR  X(N),  DIMENSION  N 

RANKS  OF  ORIGINAL  X'S  ARE  STORED  IN 

VECTOR  IRANK 

C  ORDERED  X'S  ARE  STORED  IN  XORD 

C  ORIGINAL  INDICES  OF  ORDERED  X'S 

ARE  STORED  : 

IN  INDX 

DIMENSION  X(l) ,XORD(l) , IRANK( 1 ) , INDX ( 1 ) 
DO  50  1=1, N 
INDXd  )  =  I 
50  XORD(I)=X(I) 

NM1=N-1 

DO  100  1=1, NMl 
IP1=I+1 

DO  200  J=IP1,N 

IF(XORD{I) .LE.XORD(J) )GO  TO  200 
TEMP=XORD(I ) 

XORDd  )=XORD(J) 

XORD{J)=TEMP 
ITEMP=INDXd  ) 

INDXd  )  =  INDX(J) 

INDX(J)=ITEMP 
200  CONTINUE 
100  CONTINUE 

DO  300  1=1, N 
Il=INDXd  ) 

IRANK(I1)=I 
300  CONTINUE 
RETURN 
END 


l-ogram  Unit  Length=OlAE  (430)  Bytes 
lata  Area  Length=0028  (40)  Bytes 

Lbroutines  Referenced: 


h/ari  ablest 


$L1 


$T1 


0001" 

N 

0003" 

XORD  0005" 

INDX 

0007" 

IRANK 

0009" 

I  OOOB" 

•  000000 

OOOD" 

TtOlOOOO 

OOOP" 

NMl  0011" 

^1 

0013" 

J 

0015" 

T;000002 

0017" 

T^IMP 

0018" 

ITEMP 

OOlC" 

T:020000 

OOIE" 

T.:030000 

0020" 

T;040000 

0022" 

TtOSOOOO 

0024" 

§■ 

0026" 

Labels: 

002D' 

lOOL 

015C' 

200L  0148' 

300L 

0199  ' 

900 
000 
?j^l00 

144200 

^300 

24400 

msoo 
•  600 
24700 
l?d800 
::^9oo 
25000 
25100 
•  200 
S?5300 
25400 
^•35500 

ijieoo 

25700 
^800 
>^900 
iroOOO 
26100 
^200 
Se300 
26400 
QS500 
So600 
26700 
800 
900 


SUBROUTINE  WALD( Y , X, N , INDX, A ) 

C  THIS  SUBROUTINE  COMPUTES  THE  WALD  ESTIMATE  A  IN  THE  REGRESSION 
C  EQUATION  Y=A*X. 

C  INPUT  VECTOR  Y(DEP  VAR),  VECTOR  X(INDEP  VAR),  VECTOR  INDX 
C  (ORIGINAL  INDICES  OF  THE  ORDERED  X'S,  BEFORE  ORDERING),  DIMENSION  N 
C  OUTPUT  REGRESSION  COEFFT  A 
C  X-DATA  ARE  ORDERED  (ASCENDING  ORDER) 

C  INDX  CONTAINS  INDICES  OF  Y  CORRESP  TO  EACH  X 
DIMENSION  Y(l) ,X(1) ,INDX(1) 

C  XBARl  =  MEAN  OF  X’S  BELOW  MEDIAN 
C  XBAR2  =  MEAN  OF  X'S  ABOVE  MEDIAN 
XBAR1=0.0 
YBAR1=0.0 
XBAR2=0.0 
YBAR2=0 . 0 
N2=N/2 

DO  100  1=1, N2 
XBAR1=XBAR1+X(I ) 

IY=INDX(I ) 

YBAR1=YBAR1+Y(IY) 

100  CONTINUE 

C  LEAVE  OUT  MIDDLE  OBSN  IF  N  IS  ODD 
N3=N-N2+1 
DO  200  I=N3,N 
XBAR2=XBAR2+X(I ) 

IY=INDX(I ) 

YBAR2=YBAR2+Y(IY) 

200  CONTINUE 

A= ( YBAR2-YBAR1 ) / (XBAR2-XBAR1 ) 

RETURN 

END 


Program  Unit  Length=0157  (343)  Bytes 
^ta  Area  Length=0027  (39)  Bytes 


Subroutines  Referenced; 

I 


I.T 

$D9 

SUB 


$L1 

$AB 


$T1 

$SB 


0001" 

X 

0003" 

N 

0005 

0007" 

A 

0009" 

XBARl 

OOOB 

OOOF" 

XBAR2 

0013" 

YBAR2 

0017 

00  IB" 

I 

OOID" 

lY 

OOIF 

0021" 

T:000001 

0023" 

tebels: 

xbOL  009F'  200L  0114' 


5:7000 
iWlOO 
27200 
^300 
i;7400 
27  500 
S-7600 
^700 
^800 
3,7900 

•wooo 

5?8100 

28200 

^300 

P400 

28500 

as  600 

^^8700 

•^'8800 

^900 

ftooo 

k'aioo 

29200 

:-j9300 

^^400 

29500 

#0600 


SUBROUTINE  BART ( Y ,X, N, INDX,A) 

C  THIS  SUBROUTINE  COMPUTES  THE  BARTLETT  ESTIMATE  A  IN  THE  REGRESSION 
C  EQUATION  y=A*X. 

C  THE  X-DATA  ARE  ORDERED  (ASCENDING  ORDER) 

C  INDX  CONTAINS  INDICES  OF  Y  CORRESPONDING  TO  EACH  X 
C  INPUT  VECTOR  Y(DEP  VAR),  VECTOR  X(INDEP  VAR),  DIMENSION  N 
C  OUTPUT  REGRESSION  COEFFT  A 

DIMENSION  Y(l) ,X(1) ,INDX(1) 

N3=N/3 

XBAR1=0 . 0 

YBAR1=0.0 

XBAR2=0.0 

YBAR2=0 . 0 

DO  100  1=1, N3 

XBAR1=XBAR1+X(I) 

IY=INDX(I ) 

YBAR1=YBAR1+Y{IY) 

100  CONTINUE 
N2=N~N3+1 
DO  200  I=N2,N 
XBAR2=XBAR2+X{I ) 

IY=INDX(I ) 

YBAR2=YBAR2+Y ( lY ) 

200  CONTINUE 

A=(YBAR2-YBAR1)/(XBAR2-XBAR1) 

RETURN 

END 


Program  Unit  Length=0157  (343)  Bytes 
J7,ata  Area  Length=0027  (39)  Bytes 

^broutines  Referenced: 


$DB 

^riables : 

$D9 

$AB 

$L1 

$SB 

X 

0001" 

X 

0003" 

N 

0005" 

J^NDX 

•TSBARI 

0007" 

A 

0009" 

N3 

OOOB" 

GOOD" 

YBARl 

0011" 

XBAR2 

0015" 

YBAR2 

m2 

0019" 

0021" 

I 

T;000001 

OOID" 

0023" 

lY 

OOIF" 

009F' 


200L 


0114' 


I 

Labels! 


OL 


I 


Appendix  B 

Computer  Program  Output 


I 


^OGRAM  SIMULA 

GENERATES  MULTIVARIABLE  UNIVARIATE  SAMPLE 
J|[LE  NAiME:  DPI  LI 
DIMENSION  (M)  =  1 

Ei 

SlEFFICIENT  MATRIX  (C) . . • 

COEFFT,  MATRIX  IS  IDENTITY  MATRIX 

^GMA  (SCALING  PARAMETER  FOR  INDEP  VARIABLES) 
BO  (INTERCEPT)  =  1.0000 

n 

iD\RAMETER  VECTOR  ( B )  .  .  . 

1.0000 

^<IG  (MODEL  ERROR  STD  DEV)  =  1.0000 


NO  OF  OBSNS  =  100 
^<IP  0  RANDOM  NOS  PRIOR  TO  SIMULATION 

VARIANCE  MATRIX  OF  X,  Y... 


S'  100.0000  100.0000 

"^  100.0000  101.0000 

jOEFFICIENT  OF  DETERMINATION  = 

ORRELATION  MATRIX  OF  X,  Y... 
^  1.0000  .9950 


.9901 


i\  ■ 


9950  1.0000 


OBSERVATIONS. . . 
ftpENDENT  VARIABLE  LAST 
i^IRST  3  OBSNS  PRINTED) 
OBSN  NO  1 : 

f.;  -18.3276  -18.8362 

&BSN  NO  2 : 

-13.7183  -13.1409 

flpSN  NO  3 ; 

y.'  13.3911  12.3044 


;OGRAM  MREG 

ILTIPLE  LINEAR  REGRESSION  ANALYSIS 


TA  FILE  NAME:  DFILl 

OF  OBS,  NO  OF  VARS,  INDEX  OF  DEP  VAR 
REGRESSION,  NO  OF  INDEP  VARS  IN  REGRESSION... 
100  2  2  1 
DICES  OF  INDEP  VARS  IN  REGRESSION... 


1 


(FIRST  3 
-18.3276 
vvj  -13.718  3 
13.3911 


OBSNS ) 
-18.8362 
-13.1409 
12.3044 


^ANS  FOR  ALL  VARIABLES... 
1.2364  2.2018 


rANDARD  DEVIATIONS  FOR  ALL  VARIABLES... 
9.4218  9.5641 


QDRRELATION  MATRIX... 

5?  1,0000 

.9932  1.0000 


f 


TERMINANT  =  .lOOOE+01 


INVERSE  OF  THE  PART  OF  THE  CORR  MATRIX  USED  IN  REGRESSION. 

1.0000 

PRODUCT  OF  MATRIX  AND  INVERSE... 

OULD  BE  IDENTITY  MATRIX.  IF  NOT,  ROin^JDOFF  ERRORS 
VE  RUINED  SOLUTION...  DISREGARD  REST  OF  ANALYSIS. 

1.0000 


•VNVERSE  OF  THE  PART  OF  THE  CROSS  PROD  MATRIX  USED  IN  REGRESSION.. 
Ct  .1138E-03 


«DEX  OF  DEP  VAR  IN  REGRESSION  = 
wJNDICES  OF  INDEP  VARS  IN  REGRESSION... 
1 


REGRESSION  COEFFICIENTS... 


a  1.0082 

•^-VALUES.  .  . 

84.2544 

»5TD  DEVS  OF  REG  COEFS.. 

I* jf  n 


»  .0120 

INTERCEPT  =  .9553 

RESIDUAL  STANDARD  ERROR  =  1.1217 

Sample  multiple  correlation  coefficient  = 

SAMPLE  COEFFICIENT  OF  DETERMINATION  = 


.9932 
.9864 

-■^UM  of  SQUARES  ATTRIBUTABLE  TO  REGRESSION,  SSAR  = 
^tEGREES  OF  FREEDOM  ASSOCIATED  WITH  SSAR  =  1.0000 

'Kean  square  of  SSAR  =  8932.3516 

SUM  OF  SQUARES  OF  DEVIATIONS  FROM  REGRESSION,  SSDR  = 
3GREES  OF  FREEDOM  ASSOCIATED  WITH  SSDR  =  98,0000 


8932.3516 

123.312 


r 


1.2583 


ILan  square  of  ssdr  = 
F-VALUE  =  7098.7979 


I 


§ 

1 


I 


^OGRAM  ITEREG 

ESTIMATES  REGRESSION  COEFFICIENTS  BY  AN  ITERATIVE  METHOD 
m^TA  FILE  NAME:  DFILl 

NO  OF  OBS,  NO  OF  VARS,  INDEX  OF  DEP  VAR 
^  REGRESSION,  NO  OF  INDEP  VARS  IN  REGRESSION... 

Bi  100  2  2  1 

INDICES  OF  INDEP  VARS  IN  REGRESSION... 

i 

REGRESSION  METHOD  USED  AT  EACH  ITERATION:  WALD'S  METHOD 
:-HINT  AFTER  EVERY  1  ITERATIONS 
DATA. . . 

AFIRST  3  OBSNS) 

•^'-18.3276  -18.8362 

:  '• -13.7183  -13.1409 

13.3911  12.3044 

SibANS  FOR  VARIABLES  IN  REGRESSION, 

DEPENDENT  VARIABLE  LAST... 

.V  1.2364  2.2018 

•  MEDIANS... 

^  .9692  1.7888 

TOTERQUARTILE  RANGES... 

:  12.9687  13.4609 

'  *>! 

;  'DEGRESSION  COEFFICIENTS,  INTERCEPT... 


1.0062 

.9577 

9 

1.0062 

.9577 

1.0062 

.9577 

1.0062 

.9577 

•V 

1.0062 

.9577 

1.0062 

.9577 

1.0062 

.9577 

1.0C62 

.9577 

1.0062 

.9577 

1.0062 

.9577 

.‘'“ESI DUAL  VARIANCE  =  1.25  87 

Residual  standard  error  =  1.1219 

COEFFICIENT  OF  DETERMINATION  .98  62 


tOGRAM  ITEREG 

TIMATES  REGRESSION  COEFFICIENTS  BY  AN  ITERATIVE  METHOD 
H^TA  FILE  NAME;  DFILl 

NO  OF  OBS,  NO  OF  VARS,  INDEX  OF  DEP  VAR 

f  REGRESSION,  NO  OF  INDEP  VARS  IN  REGRESSION... 

100  2  2  1 
INDICES  OF  INDEP  VARS  IN  REGRESSION... 

g|  1 

REGRESSION  METHOD  USED  AT  EACH  ITERATION:  BARTLETT'S  METHOD 


S^INT  AFTER  EVERY  1  ITERATIONS 

ft! 

DATA . . . 

f‘;FIRST  3  OBS  NS) 

-18.3276  -18.8362 

-13.7183  -13.1409 

13.3911  12.3044 

SlANS  FOR  VARIABLES  IN  REGRESSION, 
DEPENDENT  VARIABLE  LAST... 

'p  1.2364  2.2018 

MEDIANS . . . 

m  .9692  1.7888 


INTERQUARTILE  RANGES. 
12.9687  13.4609 


N-EGRESSION 
1.0103 
P  1.0103 
1.010  3 
1.0103 
1.0103 
1.010  3 
1.0103 
^  1.0103 

1.010  3 
1.010  3 


COEFFICIENTS, 
.9527 
.9527 
.9527 
.9527 
.  9527 
.9  527 
.  9  527 
.9527 
.9527 
.9527 


INTERCEPT. 


:>f:SIDUAL  VARIANCE  =  1.2587 

i&SIDUAL  STANDARD  ERROR  = 
COEFFICIENT  OF  DETERMINATION  = 


1.1219 

.9862 


PROGRAM  SIMULA 

MENERATES  MULTIVARIABLE  UNIVARIATE  SAMPLE 


FILE  NAME:  DPILl 


JtMENSION  (M)  =  1 

COEFFICIENT  MATRIX  (C) . . . 

IS  COEFFT  MATRIX  IS  IDENTITY  MATRIX 


SIGMA  (SCALING  PARAMETER  FOR  INDEP  VARIABLES)  = 


2.0000 


(INTERCEPT)  = 


1.0000 


PARAMETER  VECTOR  (B).. 

i  1.0000 


SIG  (MODEL  ERROR  STD  DEV)  =  1.0000 

OF  OBSNS  =  100 

SKIP  0  RANDOM  NOS  PRIOR  TO  SIMULATION 

I^RIANCE  MATRIX  OF  X,  Y... 

4.0000  4.0000 
4.0000  5.0000 


c-OEFFICIENT  OF  DETERMINATION 


^RRELATION  MATRIX  OF  X,  Y... 


.8000 


.8944 


1.0000 


J|;^;3SERVATIONS.  .  . 

DEPENDENT  VARIABLE  LAST 
^IRST  3  OBSNS  PRINTED) 
!^SN  NO  1: 

-3.6655  -4.1741 

OBSN  mo  2 : 

'K-  -2.7437  -2.1663 

‘fesN  NO  3 : 

2.6782  1.5916 


?OGRAM  MREG 

JLTIPLE  LINEAR  REGRESSION  ANALYSIS 
^TA  FILE  NAME:  DFILl 

)  OF  OBS,  NO  OF  VARS,  INDEX  OF  DEP  VAR 
J  REGRESSION,  NO  OF  INDEP  VARS  IN  REGRESSION... 
100  2  2  1 
IDICES  OF  INDEP  VARS  IN  REGRESSION... 

1 


^TA.  .  . 

(FIRST  3  OBSNS) 

V  -3.6655  -4.1741 

K  -2.7437  -2.1663 

2.6782  1.5916 

I'-ilANS  FOR  ALL  VARIABLES... 

.•>  .247  3  1.2127 

r-.7ANDARD  DEVIATIONS  FOR  ALL  VARIABLES... 
^  1.8844  2.2566 

CORRELATION  MATRIX... 

1.0000 

.8691  1.0000 


ilTERMINANT  =  .lOOOE+01 


INVERSE  OF  THE  PART  OF  THE  CORR  MATRIX  USED  IN  REGRESSION... 

1.0000 

PRODUCT  OF  MATRIX  AND  INVERSE... 

fOULD  BE  IDENTITY  MATRIX.  IF  NOT,  ROUNDOFF  ERRORS 
VE  RUINED  SOLUTION...  DISREGARD  REST  OF  ANALYSIS. 

1.0000 

VWERSE  OF  THE  PART  OF  THE  CROSS  PROD  MATRIX  USED  IN  REGRESSION... 
A  .2845E-02 


fDEX  OF  DEP  VAR  IN  REGRESSION  = 

DICES  OF  INDEP  VARS  IN  REGRESSION... 

^GRESSION  COEFFICIENTS.  .  . 
jg  1.0408 
■P-VALUES.  .  . 

17.3963 

.‘X'D  DEVS  OF  REG  COEFS... 

S  .0598 

INTERCEPT  =  .9553 

f  SI  DUAL  STANDARD  ERROR  =  1.1217 

MPLE  MULTIPLE  CORRELATION  COEFFICIENT  = 
SAMPLE  COEFFICIENT  OF  DETERMINATION 
SyM  OF  SQUARES  ATTRIBUTABLE  TO  REGRESSION, 
riGREES  OF  FREEDOM  ASSOCIATED  WITH  SSAR  = 


.8691 
7554 
SSAR  = 
1.0000 


380.8064 


il-EAN  SQUARE  OF  SSAR  = 


.8064 


SUM  OF  SQUARES  OF  DEVIATIONS  FROM  REGRESSION,  SSDR  = 
«GREES  OF  FREEDOM  ASSOCIATED  WITH  SSDR  ==  98.0000 


123.3153 


&-» 


Bean  square  of  ssdr  = 
F-VALUE  =  302.6309 


1.2583 


'■> 


'V.k' 


i 


PROGRAM  ITEREG 

INTIMATES  REGRESSION  COEFFICIENTS  BY  AN  ITERATIVE  METHOD 
DATA  FILE  NAMEj  DFILl 

B)  OF  OBS,  NO  OF  VARS,  INDEX  OF  DEP  VAR 
TO  REGRESSION,  NO  OF  INDEP  VARS  IN  REGRESSION... 
^100  2  2  1 
JiDICES  OF  INDEP  VARS  IN  REGRESSION... 

KGRESSION  METHOD  USED  AT  EACH  ITERATION:  WALD'S  METHOD 
^INT  AFTER  EVERY  1  ITERATIONS 

ijl^TA.  .  . 

#IRST  3  OBSNS) 

-3.6655  -4.1741 

^  -2.7437  -2.1663 


2.6782 


1.5916 


,p:ANS  FOR  VARIABLES  IN  REGRESSION, 
^PENDENT  VARIABLE  LAST... 

■  .2473  1.2127 


;^DIANS. .  . 
.19  38 


1.1462 


KTERQUARTILE  RANGES.. 


2.5937 


3.2969 


g^GRESSION  COEFFICIENTS,  INTERCEPT... 
1.0310  .9577 

1.0310  .9577 

_  1.0310  .9577 

5  1.0310  .9577 

1.0310  .9577 

1.0310  .9577 

5  1.0310  .9577 

5  1.0310  .9577 

1.0310  .9577 

^  1.0310  .9577 

kESIDUAL  VARIANCE  =  1.2587 

RESIDUAL  STANDARD  ERROR  =  1.1219 

/^EFFICIENT  OF  DETERMINATION  =  .7528 


PROGR/'":  ITEREG 

^TIMAi’ES  REGRESSION  COEFFICIENTS  BY  AN  ITERATIVE  METHOD 

mTA  FILE  NAME:  DFILl 

of  OBS,  no  of  VARS,  INDEX  OF  DEP  VAR 
TO  REGRESSION,  NO  OF  INDEP  VARS  IN  REGRESSION... 

100  2  2  1 
gNDICES  OF  INDEP  VARS  IN  REGRESSION... 

REGRESSION  METHOD  USED  AT  EACH  ITERATION;  BARTLETT'S  METHOD 


^INT  AFTER  EVERY  1  ITERATIONS 

Pata.  .  . 

^IRST  3  OBSNS) 

-3.6655  -4.1741 

L  -2.7437  -2.1663 
\  2.6782  1.5916 


fANS  FOR  VARIABLES  IN  REGRESSION, 
PENDENT  VARIABLE  LAST... 


•  .2473 

I^EDIANS .  .  . 
&  .1938 


1.2127 


1.1462 


ITERQUARTILE  RANGES... 


2.5937 


3.2969 


P,EGRESSION  COEFFICIENTS,  INTERCEPT. 


- 

1.0513 

.9527 

1.0513 

.9527 

i 

1.0513 

.9527 

1.0513 

.9527 

1.0513 

.9527 

1.0513 

.9527 

1.0513 

.9527 

1.0513 

.9527 

1.0513 

.9527 

w 

1.0513 

.9527 

■RESIDUAL  VARIANCE  =  1. 

RESIDUAL  STANDARD  ERROR  = 

Coefficient  of  determination  = 


1.2587 


1.1219 

.7528 


PROGRAM  SIMULA 

GENERATES  MULTIVARIABLE  UNIVARIATE  SAMPLE 


I 


LE  NAME;  DFILl 
MENSION  (M)  =  1 


COEFFICIENT  MATRIX  (C) . . . 

COEFFT  MATRIX  IS  IDENTITY  MATRIX 


SIGMA  (SCALING  PARAMETER  FOR  INDEP  VARIABLES) 

P)  (INTERCEPT)  =  1.0000 

PARAMETER  VECTOR  (B) . . . 

^  1.0000 

SIG  (MODEL  ERROR  STD  DEV)  =  1.0000 

Chp  OF  OBSNS  =  100 

SKIP  0  RANDOM  NOS  PRIOR  TO  SIMULATION 

&RIANCE  MATRIX  OF  X,  Y... 

■  1.0000  1.0000 

1.0000  2.0000 

i*^EFFICIENT  OF  DETERMINATION  = 


5000 


RRELATION  MATRIX  OF  X,  Y... 
1.0000  .7071 

.7071  1.0000 


f^^SERVATIONS.  .  . 

?>£ PENDENT  VARIABLE  LAST 
^IRST  3  OBSNS  PRINTED) 
fcsN  NO  1; 

^  -1.8328  -2.3413 

OBSN  NO  2: 

-1.3718 


7944 


:OGRAM  MREG 

ILTIPLE  LINEAR  REGRESSION  ANALYSIS 


TA  FILE  N7LME:  DFILl 

OF  OBS,  NO  OF  VARS,  INDEX  OF  DEP  VAR 
REGRESSION,  NO  OF  INDEP  VARS  IN  REGRESSION. 
100  221 
DICES  OF  INDEP  VARS  IN  REGRESSION... 


1 


®^TA. . . 
(FIRST 
M  -1.8328 
»  -1.3718 
™  1.3391 


3  OBS NS) 
-2.3413 
-.7944 
.2524 


SiSANS  FOR  ALL  VARIABLES... 
.1236  1.0890 


.V-TANDARD  DEVIATIONS  FOR  ALL  VARIABLES... 

^  .9422  1.5113 

TORRELATION  MATRIX... 

1.0000 

.6743  1.0000 

IjiTERMINANT  =  .lOOOE+01 

INVERSE  OF  THE  PART  OF  THE  CORR  MATRIX  USED  IN  REGRESSION... 

P  1.0000 

PRODUCT  OF  MATRIX  AND  INVERSE... 

tOULD  BE  IDENTITY  MATRIX.  IF  NOT,  ROUNDOFF  ERRORS 
VE  RUINED  SOLUTION...  DISREGARD  REST  OF  ANALYSIS. 

1.0000 

Inverse  of  the  part  of  the  cross  prod  matrix  used  in  regression... 
W  .1138E-01 


^DEX  of  DEP  VAR  IN  REGRESSION  =  2 

Indices  of  indep  vars  in  regression... 

1 

Kgression  coefficients... 

1.0816 
VALUES.  .  . 

9.0392 

DEVS  OF  REG  COEFS... 

.1197 

INTERCEPT  =  .9553 

RESIDUAL  STANDARD  ERROR  =  1.1217 

B^PLE  multiple  correlation  COEFFICIENT  =  .6743 

SAMPLE  COEFFICIENT  OF  DETERMINATION  =  .4547 

OF  SQUARES  ATTRIBUTABLE  TO  REGRESSION,  SSAK 
[S^IGREES  of  freedom  ASSOCIATED  WITH  SSAR  =  1.0000 

itoN  SQUARE  OF  SSAR  =  102.8119 

SUM  OF  SQUARES  OF  DEVIATIONS  FROM  REGRESSION,  SSDR  = 
JJIGREES  of  FREEDOM  ASSOCIATED  WITH  SSDR  =  98.0000 


102.6119 

123.3141 


1.2583 


|ean  square  of  SSDR  = 
^ALUE  =  81.7065 


I 

I 


I 

I 

I 

t 

i 


K 

f 

w 

'J 


^TIMMES "^REGRESS! ON  COEFFICIENTS  BY  AN  ITERATIVE  METHOD 
DATA  FILE  NAME;  DFILl 

K)  OF  OBS,  NO  OF  VARS,  INDEX  OF  DEP  VAR 
TO  REGRESSION,  NO  OF  INDEP  VARS  IN  REGRESSION... 

100  2  2  1 

^DICES  OF  INDEP  VARS  IN  REGRESSION... 

fGRESSION  METHOD  USED  AT  EACH  ITERATION:  WALD'S  METHOD 

'?>'RINT  after  EVERY  1  ITERATIONS 

VL 

^IRSt’  3  OBSNS) 

-1.8328  -2.3413 

S'  -1.3718  -.7944 

'§  1.3391  .2524 

WEANS  FOR  VARIABLES  IN  REGRESSION, 

IMPENDENT  VARIABLE  LAST... 

■  .1236  1.0890 


f.'EDIANS .  .  . 

.09  69 


.9399 


nterquartile  ranges, 

1.2969  1.9375 


^JSGRESSION  COEFFICIENTS,  INTERCEPT... 


1.0619 

.  9577 

1.0619 

.9577 

1.0619 

.9577 

P 

1.0619 

.9577 

,v 

1.0619 

.9577 

1.0619 

.9577 

1.0619 

.9577 

1 . 0619 

.9577 

1.0619 

.9577 

1.0619 

.9577 

RESIDUAL  VARIANCE  = 

1.2587 


RESIDUAL  STANDARD  ERROR  = 
C-iPEFFICIENT  OP  DETERiM  I  NATION  = 


1.1219 
.  4489 


;OGRAM  ITEREG 

ITIMATES  REGRESSION  COEFFICIENTS  BY  AN  ITERATIVE  METHOD 


Bata  file  name;  dfili 

BD  of  OBS,  no  of  VARS,  INDEX  OF  DEP  VAR 
Rj  REGRESSION,  NO  OF  INDEP  VARS  IN  REGRESSION,.. 
100  2  2  1 

gJDICES  OF  INDEP  VARS  IN  REGRESSION . . . 

^GRESSION  METHOD  USED  AT  EACH  ITERATION:  BARTLETT 
3«INT  AFTER  EVERY  1  ITERATIONS 

^TA.  .  . 

I^IRST  3  OBSNS) 

-1.8328  -2.3413 

-1.3718  -.7944 

1.3391  .2524 

•C:  • 

MEANS  FOR  VARIABLES  IN  REGRESSION, 

^PENDENT  VARIABLE  LAST... 

■  .1236  1.0890 


;sJ:dians.  . . 
.0969 


,9399 


r 


TERQUARTILE  RANGES.. 
1.2969  1.9375 


.REGRESSION  COEFFICIENTS,  INTERCEPT... 


a 

•••• 

1.1025 

.9527 

j 

1.1025 

.9527 

a 

1.1025 

.9527 

P 

i 

1.1025 

.9527 

w 

1.1025 

.9527 

1.1025 

.9527 

1.1025 

.9527 

V 

1.1025 

.9527 

1.1025 

.9527 

1 

1.1025 

.9527 

aESIDUAL  VARIANCE  = 

RESIDUAL  STANDARD  ERROR  = 

‘>2defficient  of  determination  = 


1.2587 


1.1219 

.4489 


METHOD 


i 


PROGRAM  SIMULA 

INERATES  MULTIVARIABLE  UNIVARIATE  SAMPLE 


f 


LE  NAME:  DFILI 
MENSION  (M)  =  1 


COEFFICIENT  MATRIX  (C)... 
kH  COEFFT  MATRIX  IS  IDENTITY  MATRIX 


GMA  (SCALING  PARAMETER  FOR  INDEP  VARIABLES) 
(INTERCEPT)  =  1.0000 


PARAMETER  VECTOR  (B) 

1.0000 


1.0000 


SIG  (MODEL  ERROR  STD  DEV)  = 

% 

OF  OBSNS  =  100 

bICIP  0  RANDOM  NOS  PRIOR  TO  SIMULATION 

S^RIANCE  MATRIX  OF  X,  Y. . . 

"  .0400 


.0400 


.0400 

1.0400 


..1 


fPEFFICIENT  OF  DETERMINATION 


.0385 


fRRELATION  MATRIX  OF  X,  Y... 
1.0000  .1961 

.1961  1.0000 

tieSERVATIONS.  .  . 

L’EPENDENT  VARIABLE  LAST 
(FIRST  3  OBSNS  PRINTED) 
j^SN  NO  1  : 

■/.  -.3666  -.87  51 

OBSN  NO  2: 

-.2744  .3030 

•^BSN  NO  3: 

.2678  -.8188 


.OGRAM  MR  EG 

ILTIPLE  LINEAR  REGRESSION  ANALYSIS 


ATA  FILE  NAME:  DFILl 


Pb  OF  OBS,  NO  OF  VARS,  INDEX  OF  DEP  VAR 
IN  REGRESSION,  NO  OF  INDEP  VARS  IN  REGRESSION... 

m  100  221 

P^DICES  OF  INDEP  VARS  IN  REGRESSION... 

1 


^vTA.  .  . 

WIRST  3  OBSNS) 
-.3666  -.8751 

-.2744  .3030 

.2678  -.8188 


J^mNS  FOR  ALL  VARIABLES. 


.0247 


,9901 


■STANDARD  DEVIATIONS  FOR  ALL  VARIABLES... 

.1884  1.1472 

B 

CORRELATION  MATRIX . . . 

1.0000 

.  2313  1.0000 

^TERMINANT  =  .lOOOE+01 

DIVERSE  OF  THE  PART  OP  THE  CORR  MATRIX  USED  IN  REGRESSION... 

r-  1.0000 

& 

Product  of  matrix  and  inverse... 

SHOULD  BE  IDENTITY  MATRIX.  IF  MOT,  ROUNDOFF  ERRORS 
■^VE  RUINED  SOLUTION...  DISREGARD  REST  OF  ANALYSIS. 

V  1.0000 

^^.-NVERSE  OF  THE  PART  OF  THE  CROSS  PROD  MATRIX  USED  IN  REGRESSION... 
»  .2845E+00 


I^DEX  OF  DEP  VAR  IN  REGRESSION  =  2 

•  nWDICES  OF  INDEP  VARS  IN  REGRESSION... 

;«  1 

J  REGRESSION  COEFFICIENTS... 

1.4080 

«■  ^VALUES... 

2.3535 

;:td  DEVS  of  reg  coefs... 

•}:  .598  3 

fNTERCEPT  =  .9553 

RBSIDUAL  STANDARD  ERROR  =  1.12] 7 

^MPLE  MULTIPLE  CORRELATION  COEFFICIENT  =  .2313 

WLMPLE  COEFFICEENT  OF  DETERMIMATIOM  .05  35 

,  SUM  OF  SQUARES  ATTRIBUTABLE  TO  HF.GR'-.SSION,  SS.\R  -- 
1  ;-^GREES  OP  FREEDOM  ASSOCIATED  Wl'iil  R.SaR  ==  1.00  0  0 

I  IL'^AN  SQUARE  OF  SSAR  =  6.96  98 

i  SUM  OF  SQUARES  OF  DEVIATIONS  FROM  REGRESSION,  SSDR.  r 
i  BEGREES  OF  FREEDOM  ASSOCIATED  WITrl  SSD:3  ==  98.0000 


6.9698 


123.3141 


Cil  IHPROVED  flLeORITHHS  FOR  ESTIIMTIOH  PREDICTION  RHD 

CONTROL(U>  VISTR  RESEARCH  CORP  TUSCON  RZ  J  Q  CRLDHELL 
26  JRN  96  N00014-8S-C-0S14 


UNCLASSIFIED 


F/a  9/2 


wtCRocor 


PROGRAM  ITEREG 

■STIMATES  REGRESSION  COEFFICIENTS  BY  AN  ITERATIVE  METHOD 


DATA  FILE  NAME;  DFILl 


fO  OF  OBS,  NO  OF  VARS,  INDEX  OF  DEP  VAR 
N  REGRESSION,  NO  OF  INDEP  VARS  IN  REGRESSION... 
^100  2  2  1 
JNDICES  OF  INDEP  VARS  IN  REGRESSION... 


GRESSION  METHOD  USED  AT  EACH  ITERATION:  WALD'S  METHOD 


RINT  AFTER  EVERY  1  ITERATIONS 


fTA.  .  . 

IRST  3  OBS NS) 
-.3666  -.8751 

»  -.2744  .3030 

•J;  .2678  -.8188 


flANS  FOR  VARIABLES  IN  REGRESSION, 
PENDENT  VARIABLE  LAST... 

.0247  .9901 


•JeDI  ANS ... 
.0194 


.9687 


l£NTERQUARTILE  RANGES... 


.2594 


1.5406 


REGRESSION  COEFFICIENTS,  INTERCEPT... 
1.3097  .9  577 


j*;;  1.3097 

1.3097 


^  J- .  J 

^^esidua 


1.3097 

1.3097 

1.3097 

1.3097 

1.3097 

1.3097 

1.3097 

1.3097 


.9577 

,9577 

,9577 

,9577 

,9577 

,9577 

,9577 

.9577 

9577 


VCESIDUAL  VARIANCE  =  1. 

RESIDUAL  STANDARD  ERROR  = 
^EFFICIENT  OF  DETERMINATION  = 


1.2587 


1.1219 

.0436 


I 


;:vVk-0,V\» 


*;i 


lOGRAM  ITEREG 

ITIMATES  REGRESSION  COEFFICIENTS  BY  AN  ITERATIVE  METHOD 


fATA  FILE  NAME;  DFILl 

Ip  OF  OBS,  NO  OF  VARS,  INDEX  OF  DEP  VAR 
ll  REGRESSION,  NO  OF  INDEP  VARS  IN  REGRESSION... 
100  2  2  1 
WDICES  OF  INDEP  VARS  IN  REGRESSION... 


^GRESSION  METHOD  USED  AT  EACH  ITERATION;  BARTLETT'S  METHOD 
ffeiNT  AFTER  EVERY  1  ITERATIONS 
g^TA. . . 

fVFIRST  3  OBSNS) 

-.3666  -.8751 

-.2744  .3030 

.2678  -.8188 


fCANS  FOR  VARIABLES  IN  REGRESSION, 
:PENDENT  VARIABLE  LAST... 

.0247  .9901 


‘^DIANS.  .  . 
.0194 


.9687 


JtoNTERQUARTILE  RANGES. . . 

I  .2594  1.5406 

REGRESSION  COEFFICIENTS,  INTERCEPT. 


1.5126 

.9527 

1.5126 

.9527 

w 

1 . 5126 

.9527 

1.5126 

.9527 

•  '* 

1.5126 

.9527 

1.5126 

.9527 

1.5126 

.9527 

1.5126 

.9527 

1.5126 

.9527 

m 

1.5126 

.9527 

V^ESIDUAL  VARIANCE  = 

RESIDUAL  STANDARD  ERROR  = 
^EFFICIENT  OF  DETERMINATION  = 


1.2587 


1.1219 

.0435 


PROGRAM  SIMULA 

INERATES  MULTIVARIABLE  UNIVARIATE  SAMPLE 


C 


LE  NAME;  DFILl 


JtMENSION  (M)  =  3 

COEFFICIENT  MATRIX  (C)... 

^  COEPFT  MATRIX  IS  IDENTITY  MATRIX 

SIGMA  (SCALI^G  PARAMETER  FOR  INDEP  VARIABLES) 


(INTERCEPT)  =  1.0000 


PARAMETER  VECTOR  (B) . . . 

1.0000  -1.0000  1.0000 

SIG  (MODEL  ERROR  STD  DEV)  =  1.0000 

OF  OBSNS  =  100 

SKIP  0  RANDOM  NOS  PRIOR  TO  SIMULATION 


RIANCE  MATRIX  OF  X 

100.0000  0.0000 

0.0000  100.0000 
0.0000  0.0000 

•y  100.0000  -100.0000 


Y.  .  . 

0.0000  100.0000 
0.0000  -100.0000 
100.0000  100.0000 
100.0000  301.0000 


)EFFICIENT  OF  DETERMINATION  = 


CORRELATION  MATRIX  OF  X,  Y. . . 


1.0000 

0.0000 

0.0000 

.5764 


0.0000 

1.0000 

0.0000 

-.5764 


-BSERVATIOMS .  .  . 


0.0000 

0.0000 

1.0000 

.5764 


DEPENDENT  VARIABLE  LAST 
■CFIRST  3  OBSNS  PRINTED) 

:5bSN  no  1 ; 


-18.3276 


rSN  NO 
13.3911 
DBSN  NO 


-14.8901 


-15.0855 

2; 

-20.8667 


-13.7183 

-6.9995 


.9967 


.5764 

-.5764 

.5764 

1.0000 


-16.3831 

28.7576 


3.3520  -20.2808  -37.1018 


I 

i 

I 


OGRAM  MREG 

LTIPLE  LINEAR  REGRESSION  ANALYSIS 
TA  FILE  NAME:  DFILl 


OF  OBS,  NO  OF  VARS,  INDEX  OF  DEP  VAR 
IN  REGRESSION,  NO  OF  INDEP  VARS  IN  REGRESSION. 
■  100  4  4  3 

BJDICES  OF  INDEP  VARS  IN  REGRESSION... 
12  3 


f 


^TA. . . 

flRST  3  OBSNS ) 

-18.3276  -15.0855  -13.7183 

13.3911  -20.8667  -6,9995 

-14.8901  3.3520  -20.2808 

SANS  FOR  ALL  VARIABLES... 

.0505  -1.4574  .7599 


-16.3831 
28.7576 
-37  .1018 


3.1979 


SyTANDARD  DEVIATIONS  FOR  ALL  VARIABLES... 
9.2663  10.3026  8.5514  16.2291 


1.0000 

,6490 


1.0000 


[  CORRELATION  MATRIX... 

{ y-i  1.0000 

{  .1866  1.0000 

^  .1537  -.0497 

.5326  -.5513 

DETERMINANT  =  .9362E+00 

It 

j  ^WVERSE  OF  THE  PART  OF  THE  CORR  MATRIX  USED  IN  REGRESSION, 
I  ’■*  1.0655 

-.2075  1.0429 

P  -.1741  .0837  1.0309 


ts," 


^  PRODUCT  OF  MATRIX  AND  INVERSE... 

■  v'iOULD  BE  IDENTITY  MATRIX.  IF  NOT,  ROUNDOFF  ERRORS 
J  3VE  RUINED  SOLUTION...  DISREGARD  REST  OF  ANALYSIS. 


1.0000 
J-  .0000 

^  -.0000 


1^' 


1.0000 

-.000* 


1.0000 


INVERSE  OF  THE  PART  OF  THE  CROSS  PROD  MATRIX  USED  IN  REGRESSION. 
J  K  .1253E-03 
I  »  -.2195E-04  .9924E-04 

!  -.2219E-04  .9602E-05  .1424E-03 

)  J^'iDEX  OF  DEP  VAR  IN  REGRESSION  4 

INDICES  OF  INDEP  VARS  IN  REGRESSION... 

^12  3 

jjfeGRESSION  COEFFICIENTS.  .  . 

^  .9964  -.9942  1.0063 

T-VALUES. . . 

^  77.3142  -86.6936  73.2542 

•N:rD  DEVS  OF  REG  COEFS... 

.0129  .0115  .0137 

^TERCEPT  =  .9340 


r 


i 


0 


& 


SSI  DUAL  STANDARD  ERROR  =  1.1511 

MPLE  MULTIPLE  CORRELATION  COEFFICIENT  =  .9976 

EPLE  coefficient  of  determination  =  .9951 

OF  SQUARES  ATTRIBUTABLE  TO  REGRESSION,  SSAR  = 
REES  OF  FREEDOM  ASSOCIATED  WITH  SSAR  =  3.0000 

MEAN  SQUARE  OF  SSAR  =  8649.2520 

■UM  OF  SQUARES  OF  DEVIATIONS  FROM  REGRESSION,  SSDR  = 
KgREES  of  FREEDOM  ASSOCIATED  WITH  SSDR  =  96.0000 

MEAN  SQUARE  OF  SSDR  =  1.3251 

S5-VALUE  =  6527.0757 


25947.7559 


127.2129 


I 


;OGRAM  ITEREG 

TIMATES  REGRESSION  COEFFICIENTS  BY  AN  ITERATIVE  METHOD 


DATA  PILE  NAME:  DFILl 


I 


OF  OBS,  NO  OF  VARS,  INDEX  OF  DEP  VAR 
IN  REGRESSION,  NO  OF  INDEP  VARS  IN  REGRESSION... 
100  4  4  3 

DICES  OF  INDEP  VARS  IN  REGRESSION... 
12  3 


GRESSION  METHOD  USED  AT  EACH  ITERATION:  Vv'ALD '  S  METHOD 


1  ITERATIONS 


PRINT  AFTER  EVERY 
^lATA.  .  . 

(FIRST  3  OBSNS) 

.p -18.3276  -15.0855  -13.7183  -16.3831 

'/  13.3911  -20.8667  -6.9995  28.7576 

■  -14.8901  3.3520  -20.2808  -37.1018 


jiEANS  FOR  VARIABLES  IN  REGRESSION, 
roPENDENT  VARIABLE  LAST... 


.0505 


-1.4574 


.7599 


3.1979 


'•■EDI  AN  S. 


-.0464 

«  • 

-1.8042 

.5005  1 

.8826 

INTERQUARTILE  RANGES... 
13.2813  14.8438 

12.0312  23 

.7656 

.'ilEGRESSION 

.8977 

COEFFICIENTS 

,  INTERCEPT. 

♦  • 

-.9887 

1.0277 

.9307 

.9905 

-.9949 

1.0086 

.9316 

H  .9952 

-.9959 

1.0077 

.9306 

>:•  .9956 

-.9959 

1.0076 

.9305 

.9956 

-.9959 

1.0076 

.9305 

•V  .9955 

-.9959 

1.0076 

.9305 

.9956 

-.9959 

1.0076 

.9305 

.9956 

-.9959 

1.0076 

.9305 

—  .9956 

-.9959 

1.0076 

.9305 

.99  56 

-.9959 

1.0076 

.9305 

RESIDUAL  VARIANCE  = 
It^SIDUAL  STANDARD  ERROR 

1.3257 
=  1 

.1514 

itoEFFICIENT 

OF  DETERMINATION  = 

.9950 

V 


PROGRAM  SIMULA 

GENERATES  MULTIVARIABLE  UNIVARIATE  SAMPLE 


[LE  NAME:  DPI LI 


JtMENSION  (M)  »=  3 

COEFFICIENT  MATRIX  (C) . . . 

f  1.0000  0.0000  0.0000 

.1000  1.0000  .1000 

.5000  .5000  1.0000 

^GMA  (SCALING  PARAMETER  FOR  INDEP  VARIABLES) 


BO  (INTERCEPT)  =  1.0000 


JiARAMETER  VECTOR  (B)  . .  . 

1.0000  -1.0000  1.0000 

V 

;'1g  (MODEL  ERROR  STD  DEV)  =  1.0000 

liO  OF  OBSNS  =  100 

C^klP  0  RANDOM  NOS  PRIOR  TO  SIMULATION 

ik 

VARIANCE  MATRIX  OF  X,  Y. . . 

>'100.0000  10.0000  50.0000  140.0000 

A-  10.0000  102.0000  65.0000  -27.0000 

50.0000  65.0000  150.0000  135.0000 


^140.0000  -27.0000  135.0000  303.0000 

COEFFICIENT  OF  DETERMINATION  =  .9967 


f.;bRRELATION  MATRIX  OF  X,  Y... 

■““*  1.0000  .0990  .4082  .8043 

.0990  1.0000  .5255  -.1536 

V  .4082  .5255  1.0000  .6332 

M  .8043  -.1536  .6332  1.0000 


^•SSERVATIONS.  .  . 

^JePENDENT  VARIABLE  LAST 
IfIRST  3  OBSNS  PRINTED) 

^SN  NO  1 : 

:•*  -18.3276  -18.2900  -30.4248  -29.8850 

OBS  N  >jO  2  : 

13.3911  -20.2275  -10.7373  24.3806 

i-I^BSN  NO  3: 

2k  -14.8901 


1650  -26.0498  -39.3538 


:OGRAM  MREG 

ILTIPLE  LINEAR  REGRESSION  ANALYSIS 


^TA  FILE  NAME:  DPI LI 


OF  OBS,  NO  OF  VARS 
IN  REGRESSION,  NO  OF 
tfi  100  4 


,  INDEX  OF  DEP  VAR 
INDEP  VARS  IN  REGRESSION... 
4  3 


Indices  of  indep  vars  in  regression... 
12  3 


l?^TA.  .  . 

fPlRST  3  OBSNS) 

^-18.3276  -18.2900  -30.4248 

K  13.3911  -20.2275  -10.7373 

*•'-14.8901  -.1650  -26.0498 


-29.8850 

24.3806 

-39.3538 


FOR  ALL  variables... 

.0505  -1.3763  .0565  2.4134 


S^PANDARD  DEVIATIONS  FOR  ALL  VARIABLES... 
^  9.2663  10.5197  11.7402  16.2744 

CORRELATION  MATRIX... 

o'  1.0000 

.2833  1.0000 

.5885  .5806  1.0000 

j  .8099  -.0630  .6842  1.0000 

DETERMINANT  =  .4300E+00 


i^JVERSE  OF  THE  PART  OF  THE  COI'R  MATRIX  USED  IN  REGRESSION... 
1.5418 

.1356  1.5204 

-.9860  -.9625  2.1390 

PRODUCT  OF  MATRIX  AND  INVERSE... 

•^I'iOULD  BE  IDENTITY  MATRIX.  IF  NOT,  ROUTUDOFF  ERIxORS 
.'AVE  RUINED  SOLUTION...  DISREGARD  REST  OF  ANALYSIS. 

1.0000 

^  .0000  1.0000 

.0000  .000*  1.0000 


inverse  of  THE  PART  OF  THE  CROSS  PROD  MATRIX  USED  IN  REGRESSION... 
^  .1814E-03 

a  .1405E-04  .1388E-03 

-.9155E-04  -.7872E-04  .1560E-03 


SAdlx  of 

INDICES 


^EGRESSION 
■  .9931 

T-VALUES. . . 

64.0  471 
J-TD  DEVS  OF 


DEP  VAR  IN  REGRESSION 
OF  INDEP  VARS  IN  REGRESSION. 
12  3 

COEFFICIENIS.  .  . 

-.9972  l.OOGO 


-73.5204  69.7856 

REG  COEFS . . . 


I 


.0155 
i^TERCEPT  = 


.0136  .0144 

.9341 


4 


tSIDUAL  STANDARD  ERROR  =  1.1514 

MPLE  MULTIPLE  CORRELATION  COEFFICIENT  =  .9976 

4PLE  COEFFICIENT  OF  DETERMINATION  =  .9951 

m  OF  SQUARES  ATTRIBUTABLE  TO  REGRESSION,  SSAR  = 
:GREES  of  freedom  associated  with  SSAR  =  3.0000 

MEAN  SQUARE  OF  SSAR  =  8697.8818 

Km  of  squares  of  deviations  from  regression,  ssdr  = 

GREES  OF  FREEDOM  ASSOCIATED  WITH  SSDR  =  96.0000 

MEAN  SQUARE  OF  SSDR  =  1.3257 

'VALUE  =  6561.1553 


r 

t  S 


26093.6445 

127.2637 


lOGRAM  I  TER  EG 

.TIMATES  REGRESSION  COEFFICIENTS  BY  AN  ITERATIVE  METHOD 


I 

i 

JiATA  FILE  NAME:  DFILl 


TO  OF  OBS, 

NO  OF  VARS, 

INDEX  OF 

DEP  VAR 

IN  REGRESSION,  NO  OF  INDEP  VARS 

IN  REGRESSION. 

m  100 

4 

4 

3 

^DICES  OF 

INDEP  VARS 

IN  REGRESSION... 

1 

2 

3 

W 

g-EGRESSION 

METHOD  USED 

AT  EACH  ITERATION:  WALD 

RRINT  AFTER 

EVERY  1 

ITERATIONS 

•  • 

^ATA. . . 

(FIRST  3 

OBSNS) 

-18.3276 

-18.2900 

-30.4248 

-29.8850 

v:  13.3911 

-20.2275 

-10.7373 

24.3806 

-14.8901 

-.1650 

-26.0498 

-39.3538 

teANS  FOR  VARIABLES  IN 

REGRESSION, 

roPENDENT  VARIABLE  LAST... 

.0505 

-1.3763 

.0565 

2.4134 

•VCDIAMS  . 


-.0464 

-1.3525 

.2002 

2.7556 

1 

iTERQUARTILE  RANGES... 

13.2813 

15.3125 

13.7500 

25.5781 

.^EGRESSION 

COEFFICIENTS 

,  INTERCEPT. . . 

•  • 

1.4114 

-.4568 

.  5350 

1.6833 

P 

1.1608 

-.7393 

.7908 

1.2927 

1.0651 

-.8805 

.9055 

1.0967 

1 .  o:  5 

-.9453 

.9553 

1.0067 

1.0126 

-.9737 

.9765 

.9671 

^  • 

1.0066 

-.9859 

.  9854 

.9501 

1.0041 

-.9910 

.9891 

.9430 

1.0031 

-.9932 

.9907 

.9400 

1.0027 

-.9941 

.9913 

.9387 

k'*. 

1.0025 

-.9945 

.9916 

.9382 

j  RESIDUAL  VARIANCE  =  1.3410 

[  v5:sidual  standard  error  = 

\  ^EFFICIENT  OF  DETERMINATION  = 


METHOD 


1.1580 

.9949 


PROGRAM  SIMULA 

Knerates  multivariable  univariate  sample 

LE  NAME;  DFILl 


H^MENSION  (M)  =  3 

COEFFICIENT  MATRIX  (C) 

tjv  1.0000  0.0000 

K  .5000  1.0000 

.9000  .9000 


0.0000 

.5000 

1.0000 


plGMA  (SCALING  PARAMETER  FOR  INDEP  VARIABLES) 


,_^0  (INTERCEPT)  =  1.0000 

S' 

!;C>ARAMETER  VECTOR  (B)  .  .  . 

1.0000  -1.0000  1.0000 

>TG  (MODEL  ERROR  STD  DEV)  =  1.0000 


yo  OF  OBSNS  =  100 

0  RANDOM  NOS  PRIOR  TO  SIMULATION 


VARIANCE  MATRIX  OF  X,  Y. 
•  *  ' 


;•> 

100.0000 

50.0000 

90.0000 

140.0000 

•  • 

50.0000 

150.0000 

185.0000 

85.0000 

90.0000 

185.0000 

262.0000 

167.0000 

140.0000 

85.0000 

167.0000 

223.0000 

COEFFICI ENT 

OF  DETERMINATION  = 

.9955 

V  . 

Correlation 

MATRIX  OF 

X,  Y. . . 

1.0000 

.  4082 

.  5560 

.9375 

.  4082 

1.0000 

.9332 

.  4648 

.  5560 

.  9332 

1.0000 

.6909 

•j 

.  9375 

.  4648 

.  6909 

1.0000 

./ESERVATICNS.  .  . 

;-;epemuent  variadls  last 

(FIRST  3  OnSNS  PRINTED) 
^SN  NO  1  : 


-18.327  6 
OfiSN  NO 
13.3911 
.‘.'•BSN  NO 
3i5  -14.8901 


-31.1084 

2: 

-17.6709 

3: 

-14 . 2334 


-43.7901 
-13. 7276 


-30.4319 

18.8337 


S' 


-30.6651  -29.9006 


IROGRAM  MREG 

ULTIPLE  LINEAR  REGRESSION  ANALYSIS 


DATA  FILE  NAME:  DFILl 


lb 


OF  OBS,  NO  OF  VARS,  INDEX  OF  DEP  VAR 
IN  REGRESSION,  NO  OF  INDEP  VARS  IN  REGRESSION. 
100  4  4  3 

DICES  OF  INDEP  VARS  IN  REGRESSION... 
12  3 


fR^TA ... 

WIRST  3 
-18.3276 
13.3911 
vi  -14.8901 


OBSNS) 

-31.1084 

-17.6709 

-14.2334 


-43.7901  -30.4319 
-13.7276  18.8337 
-30.6651  -29.9006 


>EANS  FOR  ALL  VARIABLES... 

-■I:  .0505  -1.0521  -.5064  1.5265 


STANDARD  DEVIATIONS  FOR  ALL  VARIABLES... 
9.2663  12.8614  16.4787  14.1898 


CORRELATION  MATRIX _ 


1.0000 

.5608 

.6908 

.9460 

TERr>lINANT 


1.0000 

.9437  1.0000 

.5593  .7603 

.1605E+00 


1.0000 


•I^'^VERSE  OF  THE  PART  OF  THE  CORR  MATRIX  USED  IN  REGRESSION... 
1.69  62 

-.0010  4.2709 

H  .0010  -3.4656  3.4656 

PRODUCT  OF  MATRIX  AND  INVERSE. . . 

'XiOULD  EE  IDENTITY  MATRIX.  IF  NOT,  ROU^]DOFF  ERRORS 
•■■:AVE  ruined  SOLUTION...  DISREGARD  REST  OF  ANALYSIS. 

1.6964 

^  .9512  1.0000 

,•>  1.1719  .5640  -.5004 


INA/ERSE  OF  THE  PART  OF  THE  CROSS  PROD  MATRIX  USED  IN  REGRESSION. 
>:  .1995E-0  3 

a  -.8687E-07  .2608E-03 

.67&0E-07  ■-.1652E--03  .1289E-03 


iCviDEX  OF  DEP  VAR  IN  REGRESSION  ^ 
INDICES  OF  INDEP  VARS  IN  REGRESSION... 


GRESSION 
1 . 2840 


2 

COEFFICIENTS. 

-.2726 


3 

G007 


T-VALUES. . . 
»  14.6515 

fl^D  DEVS  OF 
.0876 


j'JTERCEPT  = 


-2.7213 
REG  COEFS 
.1002 
1.4790 


8 . 5276 
.0704 


4 


lUUIUMMLmvU 


'HsiDUAL  STANDARD  ERROR  =  6.2037 

SAMPLE  MULTIPLE  CORRELATION  COEFFICIENT  =  1.0887 

PLE  COEFFICIENT  OF  DETERMINATION  =  1.1853 

OF  SQUARES  ATTRIBUTABLE  TO  REGRESSION,  SSAR  = 
DEGREES  OP  FREEDOM  ASSOCIATED  WITH  SSAR  =  3.0000 

SQUARE  OF  SSAR  =  7876.1304 

OF  SQUARES  OF  DEVIATIONS  FROM  REGRESSION,  SSDR  = 
GREES  OF  FREEDOM  ASSOCIATED  WITH  SSDR  =  96.0000 

MEAN  SQUARE  OF  SSDR  =  -38.4864 

Rvalue  =  -20  4.6469 


23628.3906 


-3694.6973 


s'* 


ii 


8 


I 


Jim' 


I 


St- 


i 


j^ROGRAM  I TER EG 

ESTIMATES  REGRESSION  COEFFICIENTS  BY  AN  ITERATIVE  METHOD 
DATA  FILE  NAME:  DFILl 

(o  OF  OBS,  NO  OF  VARS,  INDEX  OF  DEP  VAR 
IN  REGRESSION,  NO  OF  INDEP  VARS  IN  REGRESSION... 

100  4  4  3 

INDICES  OF  INDEP  VARS  IN  REGRESSION... 

12  3 


EGRESSION  METHOD  USED  AT  EACH  ITERATION;  WALD'S  METHOD 


PRINT  AFTER  EVERY  10  ITERATIONS 
(IlATA.  .  . 

(FIRST  3  OBSNS) 

-18.3276  -31.1084  -43.7901  -30.431.9 

13.3911  -17.6709  -13.7276  18.8337 

‘  -14.8901  -14.2334  -30.6651  -29.9006 

j^EANS  FOR  VARIABLES  IN  REGRESSION, 
ifePENDENT  VARIABLE  LAST... 

.0505  -1.0521  -.5064  1.5265 

C^DIANS.  .  . 

-.0464  -.7959  .4599  2.5525 

jt^TERQUARTILE  RANGES.  .  . 

13.2813  19.3750  22.0625  22.4375 


GRESSION  COEFFICIENTS,  INTERCEPT... 


1.1720 

-.4336 

.5080 

1.2684 

1.0705 

-.7462 

.7845 

1.0846 

1.0178 

-.9088 

.9282 

.9890 

.9903 

-.9933 

1.0030 

.9392 

.9761 

-1.0373 

1.0418 

.9134 

.9686 

-1.0602 

1.0621 

.9000 

.  9648 

-1.0720 

1.0726 

.8930 

.9628 

-1.0782 

1.0780 

.8893 

.9617 

-1.0814 

1.0809 

.8874 

.9612 

-1.0831 

1.0823 

.8865 

RESIDUAL  VARIANCE  = 


1.4433 


i»;«SIDUAL  STANDARD  ERROR  = 
JODEFFICIENT  OF  DETERMINATION  = 


1.2014 

.9928 


I 

PROGRAM  SIMULA 

GENERATES  MULTIVARIABLE  UNIVARIATE  SAMPLE 
^ILE  NAME:  DFILl 


JtMENSION  (M)  =  3 

COEFFICIENT  MATRIX  (C) . . . 

a  1.0000  0.0000  0.0000 

0.0000  1.0000  0.0000 

1.0000  1.0000  0.0000 


GMA  (SCALING  PARAMETER  FOR  INDEP  VARIABLES) 


BO  (INTERCEPT)  =  1.0000 


^RAMETER  VECTOR  (B)  . .  . 

1.0000  -1.0000  1.0000 


^Jg  (MODEL  ERROR  STD  DEV) 


1.0000 


NO  OF  OBSNS  =  100 

^<IP  0  RANDOM  NOS  PRIOR  TO  SIMULATION 


VARIANCE  MATRIX  OF  X,  Y. . . 


100.0000 

0.0000 

100.0000 

200.0000 

EFFICI ENT 


0.0000 

100.0000 

100.0000 

0.0000 


100.0000 

100.0000 

200.0000 

200.0000 


OF  DETERMINATION 


200.0000 

0.0000 

200.0000 

401.0000 

.9975 


{^RR  ELATION 


•A 


I 


1.0000 

0.0000 

.7071 

.9988 


MATRIX  OF  X,  Y... 
0.0000  .7071 

1.0000  .7071 

.7071  1.0000 

0.0000  .7062 


.9988 

0.0000 

.7062 

1.0000 


OBSERVATIONS. . . 

*‘;JePENDENT  VARIABLE  LAST 
'iTIRST  3  OBSNS  PRINTED) 

OBSN  NO  1: 

5-18.3276  -15.0855  -33.4131 

iisSN  NO  2: 

13.3911  -20.8667  -7.4756 


rSN  NO  3 

-14.8901 


-36.0779 

28.2815 


10.0000 


I 


3.3520  -11.5381  -28.3591 


I 


OGRAM  MREG 

ULTIPLE  LINEAR  REGRESSION  ANALYSIS 


C 


ATA  FILE  NAME;  DFILl 


OF  OBS,  NO  OF  VARS,  INDEX  OF  DEP  VAR 
IN  REGRESSION,  NO  OF  INDEP  VARS  IN  REGRESSION... 
[@  100  4  4  3 

WNDICES  OF  INDEP  VARS  IN  REGRESSION... 
12  3 


^TA.  .  . 

\FIRST  3  OBS NS) 

-18.3276  -15.0855  -33.4131 

13.3911  -20.8667  -7.4756 


-36.0779 

28.2815 


5.^^  -14.8901 


3.3520  -11.5381  -28.3591 


■iJEANS  FOR  ALL  VARIABLES... 

.0505  -1.4574  -1.4069 


1.0312 


STANDARD  DEVIATIONS  FOR  ALL  VARIABLES... 
9.2663  10.3026  15.0878  18.5535 

II 

CORRELATION  MATRIX. . . 

-Y  1.0000 

.1866  1.0000 

.7416  .7975  1.0000 

.9981  .1892  .7422 


I 


1.0000 


TERI«1INANT  =  .1582E-05 


^'^JVERSE  OF  THE  PART  OF  THE  CORR  MATRIX  USED  IN  REGRESSION... 
^3011935*9 

25585351*528446825*0 
11^46875*004165908*43610080  56*2 


DISREGARD  REST  OF  ANALYSIS, 


PRODUCT  OF  MATRIX  AND  IN’VERSE.  .  . 

<^OULD  BE  IDENTITY  MATRIX.  IF  NOT,  ROUNDOFF  ERRORS 

^VE  RUINED  SOLUTION.  - - -  - 

1.0312 

1.0000 

.0313  .9375 


^  .0313 

.0313 


INVERSE  OF  THE  PART  OF  THE  CROSS  PROD  MATRIX  USED  IN  REGRESSION 
&  .2707E+02 

£  .2707E+02  .2707E+02 

-.2707E+02  -.2707E+02  .2707E+02 

‘^-IDEX  OF  DEP  VAR  IN  REGRESSION  =  4 

Indices  of  indep  vars  in  regression... 

«  1  2  3 

jtGRESSION  COEFFICIENTS. . . 

■  2.1900  .1688  -.1921 

T-VALUES. . . 

.  2764  .0213  -.0242 

Sj'D  DEVS  OF  REG  COEFS.  .  . 

7.9237  7.9237  7.9237 

UJTERCEPT  =  .8963 


i  SI DUAL  STANDARD  ERROR  =1.5229 

MPLE  MULTIPLE  CORRELATION  COEFFICIENT  =  .9967 

lMPLE  COEFFICIENT  OF  DETERMINATION  =  .9935 

JM  OF  SQUARES  ATTRIBUTABLE  TO  REGRESSION,  SSAR  = 
_JGREES  OF  FREEDOM  ASSOCIATED  WITH  SSAR  =  3.0000 

MEAN  SQUARE  OF  SSAR  =  11285.4922 

Km  of  SQUARES  OF  DEVIATIONS  FROM  REGRESSION,  SSDR  - 
GREES  OF  FREEDOM  ASSOCIATED  WITH  SSDR  =  96.0000 

MEAN  SQUARE  OF  SSDR  =  2.3193 

■VALUE  =  4865.9995 


33856.4766 


222.6484 


& 


•5 

‘y. 


% 


I 


A 


s 


5; 


OGRAM  ITEREG 

ITIMATES  REGRESSION  COEFFICIENTS  BY  AN  ITERATIVE  METHOD 


DATA  FILE  NAME:  DFILl 

B)  OF  OBS,  NO  OF  VARS,  INDEX  OF  DEP  VAR 
IN  REGRESSION,  NO  OF  INDEP  VARS  IN  REGRESSION... 

IS  100  4  4  3 

^DICES  OF  INDEP  VARS  IN  REGRESSION... 

1  2  3 

jftGRESSION  METHOD  USED  AT  EACH  ITERATION:  WALD'S  METHOD 
PRINT  AFTER  EVERY  1  ITERATIONS 


S^TA.  .  . 

(FIRST  3  OBSNS) 

V  -18.3276  -15.0855  -33.4131 

'X-  13.3911  -20.8667  -7.4756 

^  M  ^  r\  ^  t  ^ 


-14.8901 


3.3520 


-36.0779 
28.2815 
-11.5381  -28.3591 


^ANS  FOR  VARIABLES  IN  REGRESSION, 
rePENDENT  VARIABLE  LAST... 


.0505 

i::i:DI  ANS  .  .  , 
-.0464 


-1.4574 


-1.4069 


-1.8042  -2.1631 


jjTERQUARTILE  RANGES.  .  . 

13.2813  14.8438  20.9375 


1.0312 

.9846 

26.5625 


'C-SGRESSION  COEFFICIENTS,  INTERCEPT. 


1.9979 

.0036 

.0031 

.9399 

1.9931 

.0008 

.0068 

.9413 

1.9891 

-.0029 

.0106 

.9415 

1.9852 

-.0067 

.0145 

.9416 

1.9814 

-.0105 

.0183 

.9416 

1.9775 

-.0144 

.0222 

.9416 

1.9737 

-.0182 

.0260 

.9416 

1.9698 

-.0221 

.0299 

.9416 

1.9660 

-.0259 

.0337 

.9416 

1.9621 

-.0298 

.0376 

.9416 

DUAL  VARIANCE  = 

DUAL  STANDARD  ERROR 

1.3293 
=  1 

.1529 

y)EFFICIENT  OF  DETERMINATION  = 


.9961 


V 

H 


gSROGRAM  SIMULA 

Generates  multivariable  univariate  saiMple 

FILE  NAME:  DFILl 

Dimension  (m)  =  6 

COEFFICIENT  MATRIX  (C) . . . 

K  COEFFT  MATRIX  IS  IDENTI'I'Y  MATRIX 

lipiGMA  (SCALING  PARAMETER  FOR  INDEP  VARIABLES)  = 

^'O  (INTERCEPT)  =  1.0000 


rRAMETER  VECTOR  (B)  .  .  . 

1.0000  -1.0000  1.0000 

.fIG  (MODEL  ERROR  STD  DEV)  = 


1.0000  -1.0000 
1.0000 


10.0000 


1.0000 


NO  OF  OBSNS  =  100 

?.KIP  0  RANDOM  NOS  PRIOR  ' 

Variance  matrix  of  x,  y... 


TO  simulation 


100.0000  0.0000 

0.0000  100.0000 
0.0000  0.0000 

0.0000  0.0000 

0.0000  0.0000 

0.0000  0.0000 


0.0000  0. 

0.0000  0. 

0.0000  100. 
0.0000  0. 

0.0000  0. 

0.0000  0. 

100. 


0000 

0000 

0000 


0.0000 

0.0000 

0.0000 


100.0000  -100.0000 


0000  100.0000 


0000 

0000 

0000 


|.•OEFFICIENT  OF  DETERMINATION  = 


correlation  matrix  of  X,  Y... 


0.0000 
0.0000 
100.0000  ■ 

.9983 


0.0000 

0.0000 

0.0000 

0.0000 

100.0000 

o.oooc 

-100.0000 


0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

100.0000 

100.0000 


100.0000 

-100.0000 

100.0000 

100.0000 

-100.0000 

100.0000 

601.0000 


1.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

.  4079 

0.0000 

1.0000 

0.0000 

0.0000 

0.0000 

0.0000 

-.4079 

0.0000 

0.0000 

1.0000 

0.0000 

0.0000 

0.0000 

.4079 

0.0000 

0.0000 

0.0000 

1.0000 

0.0000 

0.0000 

.  4079 

0.0000 

0.0000 

0.0000 

0.0000 

1.0000 

0.0000 

-.4079 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

1.0000 

.4079 

.4079 

-.4079 

.  4079 

.4079 

-.4079 

.  4079 

1.0000 

'Observations  . . . 

DEPENDENT  VARIABLE  LAST 
^NfIRST  3  OBSNS  PRINTED) 


JlfeSN  NO 
-18.3276 
.ASSN  NO 

4.99  27 
OBSN  NO 
16.4  380 


1: 

-15.0855 

2: 

-14.8901 

3: 

-6.5698 


-13.7183 

3.3520 

8.5474 


-4.2261 
-20 .2808 
11  .7895 


13.3911 
^  .  2114 
3.1567 


-  2  0 . 8  6  6  7 
6.8281 
2.6489 


-55.14  43 
6.3284 
44.8635 


ROGRAM  MREG 

LlTIPLE  linear  regression  ^uNALYSIS 


JATA  FILE  NAME:  DFILl 

0  OF  OBS,  NO  OF  VARS,  INDEX  OF  DEP  VAR 
IN  REGRESSION,  NO  OF  INDEP  VARS  IN  REGRESSION.. 
^100  7  7  6 

Voices  of  indep  vars  in  regression... 


1 

2 

3 

4 

5 

6 

JJata.  .  . 

t'FIRST  3 

-18.327  6 
4.99  27 
16.4380 

OBSNS) 
-15.0855  - 

-14.8901 
-6.5698 

13.7183 

3.3520 

8.5474 

-4.2261 

-20.2808 

11.7895 

13.3911 

4.2114 

3.1567 

-20.8667 

6.8286 

2.6489 

-55.1443 

6.3284 

44.8635 

<-;EANS  FOR  ALL  VARIABLES 
fv  -.0472  .4075 

-.5128 

-.9081 

-.1784 

.8763 

.1547 

gTANDAKD  DEVIATIONS  FOR 
10.2626  9.8716 

ALL  VARIABLES... 
10.8903  10.3570 

9.4954 

9.5353 

27.1463 

CORRELATION 
1.0000 
-.0039 
.2032 
^  .0771 

1  -.0281 
.2311 
...  .58  34 

MAvTRIX.  .  . 

1.0000 
-.0813 
.  1454 
-.0781 
-.0366 
-.3250 

1.0000 
-.0221 
.0677 
.1645 
.  5361 

1.0000 

.0364 

-.0104 

.3263 

1.0000 
-.1342 
-. 3464 

1.0000 
.  5620 

1.0000 

DETERMINANT  = 


,8320E+00 


AVERSE  OF  THE  PART  OF  THE  COKR  MATRIX  USED  IN  REGRESSION, 


,0972 

.0047 

,1901 

,0910 

,0170 

,2211 


,0358 
,  0698 
,1515 
0866 
,0376 


.07  47 
,0302 
0928 
,1424 


1.0317 

-.0521 

.0143 


1.0360 

.1530 


1.0966 


I-RODUCT  OP  MATRIX  AND  INVERSE... 

SHOULD  BE  IDENTITY  MATRIX.  IF  NOT,  ROUNDOFF  ERRORS 
tCAVE  RUINED  SOLUTION...  DISREGARD  REST  OF  ANALYSIS. 


1.0000 

0.0000 


:a^ve 

RUINED 

SOLUTION. . . 

DISREGARD 

REST  0 

^  1 

.0000 

.0000 

1.0000 

V 

.0000 

-.000* 

1.0000 

V 

.0000 

.000* 

-.000* 

1.0000 

.0000 

-.000* 

-.000* 

-.000* 

.0000 

-.000* 

-.000* 

.000* 

l.OQUO 


-'.VERSE  OF  TEE  FART  OF  T} 
.1052E-03 

•:  -.4657E-05  .1U74E-03 

/  -.1718E-04  .6557E-05 

-.8649E-05  -.1496E-04 

i  .1760E-05  .9337E-05 


THE  CRO.SS  PROD  MATRIX  USED  IN  RF GR CSS  E 0?s  .  .  . 


.9153E-04 
. 2700 E-0 5 
-.9063E-05 


.97 16 E-0 4 
-. 5350E-05 


.11611,-0  3 


-.2283E-04 


.4034E-05  -.1385E-04 


.1458E-05 


,1707E-04 


1218E-03 


INDEX  OF  DEP  VAR  IN  REGRESSION  = 
||nDICES  of  INDEP  VARS  IN  REGRESSION... 
®  1  2  3 

REGRESSION  COEFFICIENTS... 


11.0048 
-VALUES . . . 
98.7807 

CD  DEVS  OF 
.0102 
TERCEPT  = 


-.9907 


1.0082 


.9830 


-1.0231 


.9974 


106.2686  100.5752  -95.7703  91.1315 


.0098 


.9916 


-96.4196 
REG  CO EPS. . . 

.0103  .0095 

.9589 

f  SI  DUAL  STANDARD  ERROR  = 

MPLE  MULTIPLE  CORRELATION  COEFFICIENT  = 

‘tfrtMPLE  COEFFICIENT  OF  DETERMINATION  = 

SUM  OF  SQUARES  ATTRIBUTABLE  TO  REGRESSION, 

"^'EGREES  OF  FREEDOM  ASSOCIATED  WITH  SSAR  = 

{■^iEAN  SQUARE  OF  SSAR  =  1214  3.9961 

SUM  OF  SQUARES  OF  DEVIATIONS  FROM  REGRESSION,  SSDR  = 
•TvEGREES  OF  FREEDOM  ASSOCIATED  WITH  SSDR  =  9  3.0000 


.0107 


.9994 
.9987 
SSAR  = 
6.0000 


.0109 


72863.9766 


91.4453 


;%'EAN  SQUARE  OF  SSDR  = 
^-VALUE  =  12350.4600 


.9833 


i 


V 


4 


I  ^ •.’lV****'***’ 


jROGRAM  ITEREG 

iSTIMATES  REGRESSION  COEFFICIENTS  BY  AH  ITERATIVE  METHOD 


DATA  FILE 

NAME:  DFILl 

"o  OF  OB3, 

NO  OF  VARS, 

INDEX  OF 

DEP  VAR 

IN  REGRESSION,  NO  OF  INDEP  VARS 

IN  REGRESSION... 

7 

7 

6 

Indices  of 

INDEP  VARS  IN  REGRESSION... 

1 

2 

3 

4 

5 

6 

?^GRESSION 

METHOD  USED 

AT  EACH  ITERATION: 

WALD'S  METHOD 

1  PRINT  AFTER  EVERY  1  ITERATIONS 

r-3 

.^ATA,  .  . 

(FIRST 

3  OBSNS) 

i,*-.  -18.327  6 

-15.0855  - 

13.7183 

-4.2261 

13.3911 

-20.8667 

-55.1443 

4.9927 

-14.8901 

3.3520 

-20.2808 

4.2114 

6.8286 

6.3284 

!  16.4380 

-6.5698 

8.5474 

11.7895 

3.1567 

2.6489 

44.8635 

1  &ANS  FOR  VARIABLES  IN 

REGRESSION, 

1  ifePENDENT  VARIABLE  LAST 

•  •  « 

1  ^  -.0472 

.4075 

-.5128 

-.9081 

-.1784 

.8763 

.1547 

^'EDI  AN  S  .  .  . 

.  3833 

.4614 

-1 .1401 

-2.4292 

-.7105 

.8911 

-.9060 

mTERQUARTILE  RANGES... 

14.3750 

13.9843 

15.3906 

13.3203 

11.8750 

11.6406 

31.4258 

;  :;:egression 

COEFFICIENTS 

,  INTERCEPT... 

’  1.48  69 

-.7377 

1.0480 

.8243 

-.9862 

.8140 

.9222 

1.0289 

-.9806 

1.0338 

.9833 

-1.0185 

.9758 

.9892 

P  .9981 

-.9968 

1.0091 

.9909 

-1.0122 

.  9923 

.  9752 

1  ■-•C  1.0007 

-.9944 

1.0049 

.9898 

-1.0103 

.9926 

.9713 

;  1.0016 

-.9937 

1.0046 

.9894 

-1.0101 

.9924 

.  9708 

'  y,  1.0018 

-.9937 

1.0046 

.9894 

-1.0101 

.9923 

.  9708 

J;/  1.0018 

-.9936 

1.0046 

.9894 

-1.0101 

.9923 

.9708 

:  1.0013 

-.9936 

1.0046 

.9894 

-1.0101 

.  9923 

.9708 

1  1.0013 

-.9936 

1.0046 

.9894 

-1.0101 

.9923 

.9708 

T-  1.0018 

-.9936 

1.0046 

.9894 

-1.0101 

.9923 

.  9708 

residual  variance  == 

1.0136 

SESIDUAL  STANDARD  ERROR 

= 

1.0068 

EFFICIENT 

OF  DETEfO'IINATiC.N  = 

.9986 

y">^r;y!a‘ray;y?/yL.-7j.r  .ryv-‘y;  j^»jp.j.yjj»r^^  p  jyAjij^iP ‘?^j#^ji.-y.'.^ii; 


£ 


^OGRMl  SIMULA 

^NERATES  MULTIVARIABLE  UNIVARIATE  SAMPLE 
^LE  NAME;  DFILl 
IcMENSION  (M)  =  6 


gDEFFICIENT  MATRIX  (C) 


1.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

.1000 

1.0000 

.1000 

.1000 

.1000 

.1000 

.2000 

.2000 

1.0000 

.  2000 

.2000 

.2000 

.  3000 

.  3000 

.3000 

1.0000 

.3000 

.  3000 

.4000 

.4000 

.4000 

.4000 

1.0000 

.4000 

.5000 

.  5000 

.  5000 

.5000 

.5000 

1.0000 

f. 


-IGMA  (SCALING  PARAMETER  FOR  INDEP  VARIABLES)  = 
7^1  (INTERCEPT)  =  1.0000 


.  ^ 


PARAMETER  VECTOR  (B) 

1.0000  -1.0000 


1.0000 


TG  (MODEL  ERROR  STD  DEV)  = 


1.0000 

1.0000 


-1.0000 


10.0000 


1.0000 


i;/.)  OF  OBSNS  =  100 

/.RIP  0  RANDOM  NOS  PRIOR  TO  SIMULATION 


'^RIANCE  MATRIX  OF  X,  Y. 


B 100.0000 

10.0000 

20.0000 

30.0000 

40.0000 

50.0000 

150.0000 

10.0000 

105.0000 

38.0000 

52.0000 

66.0000 

80.0000 

9.0000 

.-W  20.0000 

38.0000 

120.0000 

74.0000 

92.0000 

110.0000 

194.0000 

30.0000 

52.0000 

74.0000 

145.0000 

118.0000 

140.0000 

219.0000 

^  40.0000 

66 . 0000 

92.0000 

118.0000 

180.0000 

170.0000 

174.0000 

50.0000 

80.0000 

110.0000 

140.0000 

170.0000 

225.0000 

275.0000 

■  150.0000 

•J.; 

9.0000 

194.0000 

219.0000 

174.0000 

275.0000 

656.0001 

COEFFICIENT 

CF  DETERMINATION  = 

.9985 

-> 

•pRRELATTON 

MATRIX  OF 

X,  Y. . . 

1.0000 

.0976 

.18  26 

.  2491 

.  2981 

.  3333 

.  5cS7 

«  .0976 

1.0000 

.  3385 

.4214 

.  4801 

.  520  5 

.6343 

T  .1826 

.  3385 

1.0000 

.  5610 

.6260 

.  6694 

.651. 

.  2491 

.  4214 

.5610 

1.0000 

.7304 

.  7751 

.7101 

.  2981 

.  4801 

.6260 

.7304 

1.0000 

.8447 

.  5064 

.3333 

.5205 

.6694 

.7751 

.  8447 

1.000'J 

.7153 

S  .5857 

.0343 

.6914 

.7101 

.  5064 

.715 ;  > 

1  .  0  00  0 

5,S’,SERVA']TC:iS.  .  . 
^EPENDIRn-T  V.\KiA.iljK  L.'\£'i’ 


(FIRST  3 

MO 

fl  -18.3276 

OBSI5'  PRIN 

1: 

—  1 9 . 46  0  2 

TED) 

-22.7412 

-20.6082 

-15.4986 

-39 . 1  4^'9 

5 . 268  1 

feSN  NO 

4.9927 

, 

/C  • 

-14 . 97  9  7 

-.4756 

-18 . 9324 

-3.7876 

-4.47 ' 

.  6  3  0  4 

1:0 

3-;  16.4380 

3  : 

-2.3118 

14.0400 

19.0559 

16.2983 

19  .  329  j 

‘2  •  '!*«.•  JO 

r 

t 

f 


;v-<  v- 


.Vv\»  i 


PROGRAM  MREG 

Multiple  linear  regression  analysis 


DATA  FILE  NAME:  DFILl 


IR)  OF  OBS,  NO  OF  VARS,  INDEX  OF  DEP  VAR 
IN  REGRESSION,  NO  OF  INDEP  VARS  IN  REGRESSION.. 
^100  7  7  6 

twDICES  OF  INDEP  VARS  IN  REGRESSION... 


^TA.  .  . 

'rFIRST  3  OBS  NS) 

-18.3276  -19.4602  -22.7412  -20.6082 

N:  4.9927  -14.9797  -.4756  -18.9324 


16.4380 


-2.3118  14.0400 


;^i£ANS  FOR  ALL  VARIABLES.. 


-.0472 


.  3305 


-.4828 


19.0559 


-.7445 


STANDARD  DEVIATIONS  FOR  ALL  VARIABLES... 
10.2626  10.1399  12.4193  13.1373 


10.2626  10.1399 

CORRELATION  MATRIX,  .  , 


5 

6 

-15.4986 

-39.8499 

-66.2681 

-3.7876 

-4.4788 

.6304 

16.2983 

19.3298 

56.9038 

-.2521 

.  2568 

-.1204 

13.6447 

16.1550 

30 . 3060 

:-4 

1.0000 

>5 

.1462 

1.0000 

.3869 

.2952 

1.0000 

.  3890 

.4894 

.  5968 

1.0000 

1 

.4330 

.4555 

.7089 

.7764 

1.0000 

.5377 

.  5005 

.7596 

.8064 

.8517 

.7104 

.1115 

.7875 

.7222 

.6205 

1.0000 

.8239 


1.0000 


i*ETER'IIis;ANT  = 


Averse  of  th 

1.4507 
.  2377 
.1210 
.1233 
"  .0397 

_  -1.1296 


.1736E-01 

E  PART  OF  THE  CORR  MATRIX  USED  IN  REGRESSION.. 


1.4447 
.  3079 
-.2822 
-.1100 
-.7634 


2.5382 
.  2253 
-.6679 
-1.7600 


3.2263 

-1.0620 

-1.7933 


4.1532 

-2.1397 


6.5947 


••FODUCT  OF  MATRIX  AND  INVERSE... 

SHOULD  DS  IDENTITY  MATRIX.  IF  NOT,  ROUNDOFF  ERRORS 

*’•*♦»•*  - .-/-XT  T-m-r/-\xi  T-iTir»rn  A\Trvr\/'r'Tt:'» 


RUINED 

SOLUTION  .  . 

.  DISREGARD 

REST  OF 

ANALYSIS. 

1.0000 

0 . 00  CO 

1.0000 

,  00  00 

.000* 

1.0000 

.  C  0  0  u 

.  000* 

.  000* 

1.0000 

.  00 UO 

.000* 

C  .  '000 

.0000 

1.0000 

.  G‘J 

.  0  0  0  * 

.  oiJO* 

.000* 

.000* 

A'F  (i'  1 

HE  PART  OF 

THE  CK  'S 

PROD  MATR 

IX  USED  IN 

140] : -H3 

3h '/'  i.-u 

.  1419E- 

03 

. 2470E-04 


,95921.-0  5  .  2470E-04  .  1662.  2 

,9236;::-l)5  -.2140E-04  .  1395E-  .1888F-03 

.2865L-05  -.8030E-05  -.  3981E-0-i  -.5985E-04 


1.0000 


2253E-0; 


V.*** 


6882E-04  -.4707E-04 


8860E-04  -.8535E-04  -.9804E-04 


2552E-03 


J-^4DEX  OF  DEP  VAR 

J-wdices  of  INDEP 

1 

W3GRFSSIOM  COEFF 
■  1.0112 
"-VALUES. . . 

86.1889  -83. 

.v??D  DEVS  OF  REG 
.0117 

INTERCEPT  = 


IN  REGRESSION  = 
VARS  IN  REGRESSIO 
2  3 


-.9825 

3.2002 
G  COEFS. 
.0118 
.9589 


3 

4 

5 

6 

1.0182 

.  9849 

-1.0278 

1.0077 

79.6708 

72.3065 

-69.0716 

63.6332 

.0128 

.0136 

.0149 

.0158 

MCSIDUAL  STANDARD  ERROR  =  .9913 

^MPLE  MULTIPLE  CORRELATION  COEFFICIENT  =  .9995 

Sample  coefficient  of  determination  =  .9990 

SUM  OF  SQUARES  ATTRIBUTABLE  TO  REGRESSION,  SSAR  = 

Agrees  of  freedom  associated  with  ssar  =  6.0000 

*CAN  SQUAr.E  of  SSAR  =  15139.2686 

SUM  OF  SQUARES  OF  DEVIATIONS  FROM  REGRESSION,  SSDR  = 
VEGREES  of  FREEDOM  ASSOCIATED  VvITH  SSDR  =  93.0000 

KEAN  SQUARE  OF  SSDR  =  .9826 

F-VALUE  =  15407.1855 


90835.6094 


91 . 3828 


m\ 
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ffltoOGRAM  ITEREG 

Sktimates  regression  coefficients  by  an  iterative  method 


Eta  file  name:  dfili 

OF  OBS,  NO  OF  VARS,  INDEX  OF  DEP  VAR 


XN  regression,  no  OF  INDEP  VARS  IN  REGRESSION... 
K  100  7  7  6 

INDICES  OF  INDEP  VARS  IN  REGRESSION... 


1 

■ 

2 

3 

4 

5 

6 

fecRESSION 

METHOD  USED 

AT  EACH 

ITERATION: 

WALD'S  METHOD 

^RINT  AFTER 

■ 

EVERY  2  ITERATIONS 

mTA.  .  . 

FIRST  3 

OBSNS) 

•f'  -18.3276 

-19.4602  - 

22.7412 

-20.6082 

-15.4986 

-39.8499 

-66.2681 

■f*'  4,9927 

-14.9797 

-.4756 

-18.9324 

-3.7876 

-4.4788 

.6304 

16.4380 

-2.3118 

14.0400 

19.0559 

16.2983 

19.3298 

56.9038 

^2 

j^ANS  FOR  VARIABLES  IN 

REGRESSION, 

DEPENDENT  VARIABLE  LAST 

•  •  « 

-C  -.047  2 

1-IeDIANS  . . . 

.3305 

-.4828 

-.7445 

-.2521 

.2568 

-.1204 

^  .3833 

-.2219 

.4932 

-2.0652 

-.4595 

-1.3928 

-.9712 

^TERQUARTILE  RANGES... 

14.3750 

14.4844 

17,7812 

19.3985 

20.6484 

21.8750 

41.3985 

.'DIGRESSION 

COEFFICIENTS 

,  INTERCEPT... 

1.4147 

-.4632 

1.4790 

.7226 

-.5361 

.  2250 

1.1584 

a  1.1234 

-.8886 

1.3225 

1.0682 

-.6632 

.4241 

1.3838 

1.0834 

-.9831 

1.1404 

1.0977 

-.6860 

.  5760 

1.3025 

1.0602 

-.9976 

1.0535 

1.0454 

-.7153 

.7003 

1.1860 

1.0376 

-.9978 

1.0154 

.9961 

-.7622 

.8017 

1.0921 

1.0189 

-.9972 

.9979 

.9649 

-.8154 

.8810 

1.0255 

1.0G54 

-.9973  • 

.  9896 

.  9483 

-.8653 

.  9404 

.  9807 

.9963 

-.9979 

.9858 

.9411 

-.9072 

.  9829 

.9518 

?  .9904 

-.9987 

.9845 

.9390 

-.9399 

1.0122 

.9338 

A  .9867 

-.9994 

.  98  4  4 

.9396 

-.9641 

1.0318 

.9231 

j?xESIDUAL  VARIANCE  = 

^SIDUAL  STANDARD  ERROR  = 
COEFFICIENT  OF  DETERMINATION 


1.3218 


1.1497 

.9986 


JlOGRAM  SIMULA 

SNERATES  MULTIVARIABLE  UISI I  VARIATE  SAMPLE 


j^LE  NAME:  DFILl 
MMENSION  (M)  =  6 

^EFFICIENT  MATRIX  (C). 


1.0000 

o.o;  00 

0.0000 

0.0000 

0.0000 

0.0000 

.1000 

1.0000 

.1000 

.1000 

.1000 

.1000 

.3000 

.3000 

1.0000 

.3000 

.3000 

.3000 

.5000 

.  5000 

.5000 

1.0000 

.5000 

.5000 

.7000 

.7000 

.7000 

.7000 

1.0000 

.7000 

.9000 

.9000 

.9000 

.9000 

.9000 

1.0000 

SIGMA  (SCALING  PARAI'lETER  FOR  INDEP  VARIABLES)  =  10.0000 

(INTERCEPT)  =  1.0000 

%s 

PARAMETER  VECTOR  (B)... 

^  1.0000  -1.0000  1.0000  1.0000  -1.0000  1.0000 

SIG  (MODEL  ERROR  STD  DEV)  =  1.0000 

I'jb  OF  OBSNS  =  100 

SKIP  0  RANDOM  NOS  PRIOR  TO  SIMULATION 


ttRIANCE  MATRIX  OF  X, 

Y.  .  . 

■lOO.OOOO 

10.0000 

30.0000 

50.0000 

70.0000 

90.0000 

190.0000 

10.0000 

105.0000 

52.0000 

80.0000 

108.0000 

136.0000 

65.0000 

30.0000 

52.0000 

145.0000 

140.0000 

184.0000 

228.0000 

307.0000 

50.0000 

80.0000 

140.0000 

225.0000 

260.0000 

320.0000 

395.0000 

70.0000 

108.0000 

184.0000 

260.0000 

345.0000 

412.0000 

473.0000 

M  90.0000 

136.0000 

228.0000 

320.0000 

412.0000 

505.0000 

595.0000 

"  190.0000 

65.0000 

307.0000 

395.0000 

473.0000 

595.0000 

950.0001 

COEFFICI ENT 

OF  DETERMINATION  = 

.9989 

&RRELATION 

MATRIX  OF 

X ,  Y  .  .  . 

1.0000 

.0976 

.  2491 

.  3333 

.  3769 

.  4005 

.6164 

??  .0976 

1.0000 

.4214 

.5205 

.5674 

.5906 

.  2058 

-  2491 

.4214 

1.0000 

.7751 

.8227 

.8426 

.8272 

.  3333 

.  5205 

.7751 

1.0000 

.9332 

.9493 

.8544 

V**  *  37  69 

.5674 

.8227 

.9332 

1.0000 

.9871 

.8262 

g  .4005 

"  .6164 

.  5906 

.8426 

.9493 

.9871 

1.0000 

.8590 

.  2058 

.8272 

.8544 

.8262 

.8590 

1.0000 

»iBSERVATTONS  .  .  . 

SfePENDENT  VARTABLP:  f.AST 
(FIRST  3  OBSNS  I-RIMTED) 

43SN  NO  1: 

£-18.3276  -19.4602  -27.2527  -31 

OBSN  NO  2: 

4.9927  -14.9797  -2.3894  -18 

NO  3: 

16.4380  -2.3118  16.7864  23 


5296 

-37.1658 

-55.0364 

-75.2203 

0335 

-9.7869 

-13.5247 

-3.4312 

9001 

26.1545 

32 . 6745 

67.9829 

i 

SWOGRAM  MREG 

0JLTIPLE  LINEAR  REGRESSION  ANALYSIS 


,TA  FILE  NAME;  DFILl 


.TO  OF  OBS, 

NO  OF  VARS, 

INDEX  OF 

DEP  VAR 

IN  REGRESSION,  NO  OF  INDEP  VARS 

IN  REGRESSION... 

100 

7 

7 

6 

INDICES  OF 

INDEP  VARS 

IN  REGRESSION... 

1 

2 

3 

4 

5 

6 

^TA .  .  . 

(FIRST  3 

OBSNS) 

-18.3276 

-19.4602 

-27.2527 

-31.5296 

-37.1658 

-55.0364 

-75.2203 

5  4.9927 

-14.9797 

-2.3894 

-18.0335 

-9.7869 

-13.5247 

-3.4312 

^  16.4380 

-2.3118 

16.7864 

23.9001 

26.1545 

32.6745 

67.9829 

^'EANS  FOR  ALL  VARIABLES... 

-.047  2 

.  3305 

-.4678 

-.6354 

-.  3074 

-.2388 

-.4367 

i-TAND.ARD  DEVIATIONS  FOR  ALL  VARIABLES... 

£  10.2626 

10.1399 

13.7158 

16.5319 

19.9104 

24.6741 

36.3758 

CORRELATION 

MATRIX.  .  . 

1.0000 

.14  62 

1.0000 

.  4448 

.3738 

1.0000 

m  .4830 

.5431 

.8001 

1.0000 

■  .5294 

.5416 

.8655 

.9471 

1.0000 

.  5623 

.  5633 

.8792 

.9577 

.9886 

1.0000 

V  .7218 

.2374 

.8849 

.8695 

.8761 

.9041 

1.0000 

DETERi'IINANT 

=  .4848E- 

03 

INVERSE  OP 

THE  PART  OF 

THE  CORR 

MATRIX  USED  IN  REGRESSION... 

1.5937 

.  4234 

1.6643 

-A  .4623 

.7992 

4.7397 

•A  .6213 

.0557 

1.4482 

11.2243 

-.2875 

-.3710 

-2.8464 

-6.8279 

23.9330 

r?  .  2875 

.3710 

2.8464 

6.8279 

-13.8066 

13.8066 

‘R'RODUCT  OF  MATRIX  AND  INVERSE.  .  . 
jiHOULD  BE  IDENTITY  MATRIX.  IF  NOT,  ROUNDOFF  ERRORS 
S^AVE  RUINED  SOLUTION...  DISREGARD  REST  OF  ANALYSIS. 

2.1708 

1.1729  2.0632 

1.8305  1.6593  6.4417 

1.9940  1.8075  5.9277  12.7853 

2.0584  1.8659  6.1191  12.1657  1.0000 

g  2.1395  1.9358  6.2965  12.4811  .4413  -9.8914 

INVERSE  OF  THE  PART  OF  THE  CROSS  PROD  MATRIX  USED  IN  REGRESSION... 
.r  .1528E-03 

'v  .4110E-04  .1635E-03 

.3318E-04  .5805E-04  .  2545E-03 

.3699E-04  .3358E-05  .6451E-04  .4148E-03 

H-.1421E-04  -.1856E-04  -.1053E-03  -.2095E-03  .6098E-03 


I 


.1147E-04  .1498E-04 


.8496E-04 


,1691E-03  -.2839E-03 


.2291E-03 


DEX  OF  DEP  VAR  IN  REGRESSION  = 
DICES  OF  INDEP  VARS  IN  REGRESSION.. 

12  3 

GRESSION  COEFFICIENTS... 


y.i 

P  1.1545 
T-VALUES . . . 
.8738 


-.8600 


1.2206 


1.2504 


.5745 


-.4873  13.4723 


-.1847 


8.3293 


2.1767 


2.6391 


1.6175 


_  -.6293  .7159 

HtD  DEVS  OF  REG  COEFS... 

■  1.3212  1.3665  1.7049 

INTERCEPT  =  4.3353 

fcSIDUAL  STANDARD  ERROR  =  106.8683 

^&1PLE  MULTIPLE  CORRELATION  COEFFICIENT  =  3.0180 

SAMPLE  COEFFICIENT  OF  DETERMINATION  =  9.1081 

^M  OF  SQUARES  ATTRIBUTABLE  TO  REGRESSION,  SSAR  =  1193133.7500 

^GREES  OF  FREEDOM  ASSOCIATED  WITH  SSAR  =  6.0000 

TOAN  SQUARE  OF  SSAR  =  198855.6250 

SUM  OF  SQUARES  OF  DEVIATIONS  FROM  REGRESSION,  SSDR  =  -1062137.0000 
JISeGREES  of  FREEDOM  ASSOCIATIiD  WITH  SSDR  =  93.0000 

^f!EAN  SQUARE  OF  SSDR  =  -11420.8281 

F-VALUE  =  -17.4117 


I 


iV 


PROGRAM  I TER EG 

Estimates  regression  coefficients  by  an  iterative  method 
DATA  file  NAME:  DFILl 


Wo  OF  OBS,  NO  OF  VARS,  INDEX  OF  DEP  VAR 
IN  REGRESSION,  NO  OF  INDEP  VARS  IN  REGRESSION... 

Tv-  100  7  7  6 

Indices  of  indep  vars  in  regression... 

1  2  3  4  5 

PeGRESSION  method  used  at  each  ITERATION:  WALD'S  METHOD 
wf 

PRINT  AFTER  EVERY  20  ITERATIONS 

^Jata.  .  . 

(FIRST  3  OBSNS) 


V  -18.3276 

-19.4602  - 

■27.2527 

-31.5296 

-37.1658 

-55.0364 

-75.2203 

\\  A.  9921 

-14.9797 

-2.3894 

-18.0335 

-9.7869 

-13.5247 

-3.4312 

’•  16.4380 

-2.3118 

16.7864 

23.9001 

26.1545 

32.6745 

67.9829 

&ANS  FOR  VARIABLES  IN 
ifePENDENT  VARIABLE  LAST 

REGRESSION, 

•  •  * 

-.0472 

v. 

.  3305 

-.4678 

-.6354 

-.3074 

-.2388 

-.4367 

•.|eDI  an  3 .  .  . 

.3833 

-.2219 

.4036 

-2.3108 

.6584 

-1.5403 

-2.3765 

INTERQUARTILE  RANGES... 

14.37  50 

14.4844 

17.6015 

24.9023 

27.1836 

33.7930 

51.1875 

»tEGRESSION 

COEFFICIENTS 

,  INTERCEPT... 

1.1139 

-.9267 

1.1271 

1.1911 

-.4600 

.  3392 

1.1458 

1.0836 

-.9421 

1.0840 

1.1023 

-.4941 

.  4559 

1.0903 

P  1.0596 

-.9564 

1.0590 

1.0516 

-.5782 

.5773 

1.0531 

1.0402 

-.9690 

1.0398 

1.0164 

-.6630 

.6850 

1.0247 

1.0242 

-.9796 

1.0242 

.9890 

-.7376 

.7763 

1.0016 

1.0109 

-.9885 

1.0114 

.9666 

-.8006 

.8529 

.9826 

.9998 

-.9960 

1.0007 

.9480 

-.8535 

.9168 

.9667 

.9905 

-1.0022 

.9918 

.9326 

-.8976 

.9700 

.9536 

^  .9828 

-1.0073 

.9844 

.9198 

-.9343 

1.0144 

.9426 

■4  .97  64 

-1.0116 

.9783 

.9091 

-.9649 

1.0514 

.9335 

PRINT  AFTER 

EVERY  20  ITERATIONS 

^  .9710 

-1.0152 

.9731 

.9002 

-.9904 

1.0822 

.9258 

y  .9666 

-1.0182 

.9688 

.8927 

-1.0117 

1.1079 

.9195 

.9629 

-1.0207 

.  9653 

.  8866 

-1.0294 

1.1293 

.9142 

;%t;sidual  variance  =  1.287  3 

kESIDUAL  STANDARD  ERROR  =  1.1346 

COEFFICIENT  OF  DETERMINATION  =  .9990 


IROGRAM  SIMULA 

[ENERATES  MULTIVARIABLE  UNIVARIATE  SAMPLE 


FILE  NAME:  DFILl 
IlMENSION  (M)  =  6 

SPEFFICI ENT  MATRIX  (C)... 


li 

1.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

1.0000 

0.0000 

0.0000 

0.0000 

0.0000 

m 

0.0000 

0.0000 

1.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

1.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

1.0000 

0.0000 

0.0000 

0.0000 

0.0000 

1.0000 

1.0000 

0.0000 

•^IGMA  (SCALING 

!  PARAMETER 

:  FOR  INDEP 

'  VARIABLES)  = 

10.0000 

^>0  (INTERCEPT)  = 


1.0000 


"Parameter  vector  (b)  . . . 

1.0000  -1.0000  1.0000  1.0000 


-1.0000 


1.0000 


:G  (MODEL  error  STD  DEV)  = 


1.0000 


OF  033NS  =  100 

;<.<IP  0  RANDOM  NOS  PRIOR  TO  SIMULATION 


rRIANCE  MATRIX  OF  X,  Y... 
100.0000  0.0000  0.0000 
0.0000  100.0000  0,0000 

..  0.0000  0.0000  100.0000 

y,  0.0000  0.0000  0.0000 

0.0000  0.0000  0.0000 

0.0000  0.0000  0.0000 

100.0000 


100.0000  -100.0000 


COEFFICIENT  OF  DETERMINATION  = 
CORRELATION  MATRIX  OF  X,  Y... 


0.0000 

0.0000 

0.0000 

100.0000 

0.0000 

100.0000 

200.0000 

.9986 


0.0000 

0.0000 

0.0000 

0.0000 

100.0000 

100.0000 

0.0000 


0.0000 

0.0000 

0.0000 

100.0000 

100.0000 

200.0000 

200.0000 


100.0000 

-100.0000 

100.0000 

200.0000 

0.0000 

200.0000 

701.0000 


1.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

.  3777 

0.0000 

1.0000 

0.0000 

0.0000 

0.0000 

0.0000 

-.3777 

0.0000 

0.0000 

1.0000 

0.0000 

0.0000 

0.0000 

.  3777 

0.0000 

0.0000 

0.0000 

1.0000 

0.0000 

.7071 

.7554 

0.0000 

0.0000 

0.0000 

0.0000 

1.0000 

.7071 

0.0000 

0.0000 

0.0000 

0.0000 

.7071 

.7071 

1.0000 

.5341 

.3777 

-.3777 

.  3777 

.7554 

0.0000 

.5341 

1.0000 

^OBSERVATIONS.  .  , 

^l^PENDENT  VARIABLE  LAST 
(FIRST  3  0BSN3  PRINTED) 
aBSN  NO  1: 

jl  -18.3278  -15.0855  -13,7183 

feSN  NO  2: 

4.9927  -14.8901  3.3520 


-4.2261  13.3911  9.1650  -25.1126 


3.3520  -20.2808  4.2114  -16.0693  -16.5696 


ydSN  NO 
-1  ^  . 


16.4380  -6.569S 


8 .5474 


11.7395  3.1567  14.9463 


57 . 1609 


StROGRAM  MREG 

gjLTIPLE  LINEAR  REGRESSION  ANALYSIS 


ATA  FILE  NAME:  DFILl 


Sib  OF  OES,  NO  OF  VARS,  INDEX  OF  DEP  VAR 
IN  REGRESSION,  NO  OF  INDEP  VARS  IN  REGRESSION... 
gl  100  7  7  6 

IS'JDICES  OF  INDEP  VARS  IN  REGRESSION... 


J|vTA.  .  . 

(FIRST  3  OBSNS) 

„  -18.3276  -15.0855 

§  4.9927  -14.8901 

*  16.4380  -6.5698 


3.3520  -20.2808 

8.5474  11.7895 


C^rLANS  FOR  ALL  VARIABLES... 


-.0472 


.4075 


-.5128 


-.9081 


^  NfANDARD  DEVIATIONS  FOR  ALL  VARIABLES... 
^  10.2626  9.8716  10.8903  10.3570 

CORRELATION  MATRIX... 


5 

6 

13.3911 

9.1650 

-25.1126 

4.2114 

-16.0693 

-16.5696 

3.1567 

14.9463 

57.1609 

-.1784 

-1.0865 

-1.8081 

9.4954 

14.3035 

27.7991 

r  K 

1.0000 

E 

-.0039 

l.OGOO 

.2032 

-.0813 

1.0000 

1  i 

.0771 

.1454 

-.0221 

1.0000 

1  1 

-.0281 

-.0781 

.0677 

.0364 

1.0000 

V 

.0372 

.0534 

.0289 

.7483 

.6902 

?  ‘i; 

.5096 

-.2773 

.4819 

.7073 

.0629 

• 

EPRil  NANT 

=  .8157E- 

■07 

1.0000 

.5539 


jfiVERSE  OF  THE  PART  OF  THE  CORR  MATRIX  USED  IN  REGRESSION... 
1.0526 

.0029  1.0345 

:  V  -.2188  .0747  1.0562 

.  4.7693  -.  4235  .  665658642455*0 

4.5012  -.1675  .507953763765*049290965*0 

^  -6.7083  .3750  -.87508098794*007425025*5011184811*0 

PRODUCT  OF  MATRIX  AND  INVERSE... 

SHOULD  BE  IDENTITY  MATRIX.  IF  NOT,  ROUTSJDOFF  ERRORS 


1.0000 


.^AVE  RUINED  SOLUTION. 

“  1.0000 

0.0000  l.UOOO 

■'1;  -.0000  -.000* 

X  0.0000  .0000 

0.0000  -.0000 

.0000  -.000  ■ 


DISREGARD  REST  OF  ANALYSIS. 


1.0000 

-.000* 

0.0000 

0.0000 


1 . 0000 
.500  0 
.5000 


1.0000 

.  5000 


0.0000 


.1  I'i 

OE  Tii 

E  PARI'  f'L'  111 

E  CROSS  PRt'D 

MATRIX  USED  IN  RECRESSiO 

. 101 01 -03 

. 2901 E-0 6 

.107  2i;-0  3 

-.1978E-04 

.7U16F-05 

.899  6I';-04 

.4532E-03 

-.4184E-U4 

. 5961E-04 

.5  522E+0  3 

m 

. 4666E-03 

-.180 5 E-0 4 

.4961E-04 

.5522E+03  .5522E+03 

i 


-.4616E-03 


.2683E-04 


,5674E-04 


INDEX  OF  DEP  VAR  IN  REGULSSION  = 

f DICES  OF  IN'DEP  VARS  IN  REGRESSION... 

12  3  4 

REGRESSION  COEFFICIENTS. . . 

H  1.0043  -.9906  1.0079  1.3420 

R- VALUES. . . 

11.5150  -11.0202  12.2417  .0066 

f|.’D  DEVS  OF  REG  COEFS .  .  . 


-.5522E+03  -.5522E+03 

7 

5  6 

-.7319  .9718 

-.0036  .0048 

203.9858  203.9859 


.5522E+T3 


.0872  .0899  .0823  203.9857 

INTERCEPT  =  1.30  38 

RESIDUAL  STANDARD  ERROR  =  8.6805 

BLmPLE  multiple  CORRELATION  COEFFICIENT  =  1.0448 

Sample  coefficient  of  determination  =  1.0916 

SUM  OF  SQUARES  ATTRIBUTABLE  TO  REGRESSION,  SSAR  = 
fcJ^GREES  OF  FREEDOM  ASSOCIATED  WITH  SSAR  =  6.0000 

t^AN  SQUARE  OF  SSAR  =  13918.9697 

SUM  OF  SQUARES  OF  DEVIATIONS  FROM  REGRESSION,  SSDR  = 
>^GREES  OF  FREEDOM  ASSOCIATED  WITH  SSDR  =  93.0000 

jfeAN  SQUARE  OF  SSDR  =  -75.3516 

F-VALUE  =  -184.7202 


83513.8203 

-7007.7031 


I 


•j 

•j 


i 


^OGRAM  ITEREG 

3TIMATES  REGRESSION  COEFFICIENTS  BY  AN  ITERATIVE  METHOD 


KTA  file  NAME:  DFILl 

OF  OBS,  NO  OF  VARS,  INDEX  OF  DEP  VAR 
IN  REGRESSION,  NO  0?  INDEP  VARS  IN  REGRESSION... 
ffi  100  7  7  6 

BSIDICES  OF  INDEP  VARS  IN  REGRESSION... 

1  2  3  4  5 


GRESSION  METHOD  USED  AT  EACH  ITERATION:  WALD'S  METHOD 


PJ^INT  AFTER  EVERY  1  ITERATIONS 

^TA.  .  . 

(FIRST  3  OBSNS) 


-18.3276 

-15.0855 

-13.7183 

-4.2261 

13.3911 

9.1650 

-25.1126 

>■  4.9927 

-14.8901 

3.3520 

-20.2808 

4.2114 

-16.0693 

-16.5696 

16.4380 

r. 

-6.5698 

8.5474 

11.7895 

3.1567 

14.9463 

57.1609 

jl^ANS  FOR  VARIABLES  IN  REGRESSION, 
DEPENDENT  VARIABLE  LAST... 


UXliriLlMUHilN  i  Lilli  Li/\O  i 

-.0472  .4075 

... 

-.5128 

-.9081 

-.1784 

-1.0865 

-1.8081 

:<i:Di  ANS  . . , 

.3833 

.4614 

-1.1401 

-2.4292 

-.7105 

-2.5537 

-3.3587 

^TERQQARTILE  RANGES.  .  . 

14.3750 

13.9843 

15.3906 

13.3203 

11.8750 

16.3281 

37.7148 

JjEGRESSION 

COEFFICIENTS 

,  INTERC 

EPT.  .  . 

"  1.1909 

-.5521 

.9881 

1.8828 

.0843 

.0026 

.7074 

^  .9934 

-.9575 

1.0006 

1.9690 

-.0044 

.  0051 

.9350 

i!  .9997 

-.9880 

1.0031 

1.9819 

-.0136 

.  0034 

.9571 

*-•  .9999 

-.9919 

1.0036 

1.9856 

-.0127 

.0010 

.  9599 

.9999 

-.9923 

1.0037 

1.9883 

-.0104 

-.0014 

.9603 

rc  .9999 

-.9924 

1.0037 

1.9907 

-.0080 

-.0039 

.9603 

.9999 

-.9924 

1.0037 

1.9932 

-.0056 

-.0064 

.  9603 

.9999 

-.9924 

1.0037 

1.9957 

-.0031 

-.0088 

.9603 

«  .9999 

-.9924 

1.0037 

1,9981 

-.0006 

-.0113 

.9603 

{{  .9999 

i  * 

-.9924 

1.0037 

2.0006 

.0018 

-.0137 

.9603 

RESIDUAL  VA 

RIANCE  = 

1.0021 

^SIDUAL  ST 

ANDARD  ERROR 

= 

1.0011 

JSDEFFICIENT 

OF  DETERMINATION  = 

.9987 

^OGRAM  SIMULA 

^NERATES  MULTIVARIABLE  U’NIVARIATE  SAMPLE 

{LE  NAME:  DFILl 

MENSION  (M)  =  10 

rEFFICI ENT  MATRIX  (C)... 

COEFPT  MATRIX  IS  IDENTITY  MATRIX 

^GMA  (SCALING  PARAxMETER  FOR  INDEP  VARIABLES)  = 

^>3  (INTERCEPT)  =  1.0000 


10.0000 


1 

lRAMETER  vector  (B) 
1.0000  1.0000 

-1.0000 

1.0000 

1.0000 

-1.0000 

1.0000 

1.0000 

-1.0000  1.0000 

V 

>IG  (MODEL  ERROR  STD 

NIP  OF  OBSNS  =  100 
^IP  0  RANDOM  NOS  : 

VARIANCE  MATRIX  OF  X 
;•>  100.0000  0.0000 

DEV)  -  1.0000 

PRIOR  TO  SIMULATION 

,  Y... 

0.0000  0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

< 

0.0000 

0.0000 

0.0000 

100.0000 

100.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

1 

0.0000 

O.OOQO 

0.0000 

0.0000 

100.0000 

100.0000 

0,0000 

0.0000 

0.0000 

0.0000 

o.ooco 

*•* 

0.0000 

0.0000 

0.0000 

0.0000 

-100.0000 

0.0000 

100.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

100.0000 

0.0000 

0.0000 

100.0000 

0.0000 

0.0000 

0.0000 

■ 

0.0000 

0.0000 

0.0000 

0.0000 

100.0000 

0.0000 

0.0000 

0.0000 

100.0000 

0.0000 

o.ocoo 

< 

0.0000 

0 . 0000 

0.0000 

0.0000 

-100.0000 

0.0000 

0.0000 

0.0000 

0.0000 

100.0000 

0 .0000 

0.0000 

0.0000 

0.0000 

0.0000 

100.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

100.0000 

0.0000 

0.0000 

0.0000 

0.0000 

100.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

k  ^ 

k 

100.0000 

0.0000 

0.0000 

0.0000 

-100.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

N' 

0.0000 

100.0000 

100.0000 

100.0000 

100.0000 

-100.0000 

100.0000 

100.0000 

-100.0000 

100.0000 

100.0000 

22-100.0000 

100.0000 

1001.0000 

COEFFICIENT  OF  DETERMINATION  = 
'correlation  MATRIX  OF  X,  Y.  .  . 


1.0000 

0.0000 

0.0000 

0.0000 

0.0000 

.3161 

0 . 0000 

1.0000 

0.0000 

0.0000 

0.0000 

.  3161 

0,0000 

0.0000 

1.0000 

0.0000 

0.0000 

-.3161 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

.  3161 

.9990 

0.0000 

0.0000 

0.0000 

1.0000 


0 . 0000 
0.0000 
0.0000 
0.0000 


0.0000 
0.0000 
0.0000 
0 . 0000 


0.0000 
0 . 0000 
0 .0000 
0 . 0000 


0 . 0  0  e  0 
0.0000 
O.OOQO 
0.00 CO 


0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

1.0000 

0.0000 

0.0000 

.3161 

-.3161 


0.0000 
0.0000 
0.0000 
0.0000 
0.0  00 
0 .0000 
0.0000 
0.0000 
0.0000 
0.0000 
0.0000 
1.0000 
.3161 
.3161 


0.0000 
.3161 
0.0000 
-.3161 
0.0000 
.3161 
0.0000 
.  3161 
0.0000 
-.3161 
0.0000 
.3161 
-.3161 
1.0000 


0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

.3161 


1.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

.3161 


0.0000 

1.0000 

0.0000 

0.0000 

0.0000 

0.0000 

-.3161 


0.0000 

0.0000 

1.0000 

0.0000 

0.0000 

0.0000 

.3161 


OBSERVATIONS. . . 

I^^SPENDENT  VARIABLE  LAST 
'4^IRST  3  OBSNS  PRINTED) 
iOBSN  NO  1 : 

>>-18.3276  -15.0855  -13 

-14.8901  3.3520  2  = 

&ESN  NO  2: 

^  4.2114  6.8286  -2 

2  2.6489  10.2661  3£ 

ifeSN  NO  3 : 

-10.1245  -8.1323  -8 

13,3130  20.30  52  3C 


0.0000 
0.0000 
0.0000 
1.0000 
0.0000 
0 . 0000 
.3161 


15.0855 

3.3520 

-13.7183 

25.5442 

-4.2261 

13.3911 

-20.8667 

-6-9995 

4.9927 

6.8286 

10.2661 

-2.4292 

38.9543 

16.4380 

-6.5698 

8.5474 

11.7895 

3.1567 

-8.1323 

20.3052 

-8.0151 

30.1770 

10.2270 

6.5942 

-8.9136 

3.7036 

4.4458 

.'Civf-.*' 


•  % 
.•-l'vV 


bus 


O  V*  % 
W.V."- 


OGRAiM  MREG 

ILTIPLE  LINEAR  REGRESSION  ANALYSIS 


FILE  NAME;  DFILl 

1 

i)  OF  OBS, 

NO  OF  VARS 

,  INDEX  OF 

DEP  VAR 

IN  REGRESSION,  NO  OF 

INDEP  VARS 

IN  REGRESSION... 

n 

100 

11 

11 

10 

NDICES  OF 

INDEP  VARS 

IN  REGRESSION... 

1 

2 

3 

4 

5 

6 

7 

8 

5  ^ 

fa 

i5^ta.  . . 

10 

(FIRST  3 

OBSNS) 

-18.3276 

-15.0855 

-13.7183 

-4.2261 

13.3911  - 

20.8667 

-6.9995 

4.9927 

4 

-14.8901 

3.3520 

25.5442 

4.2114 

6.8286 

-2.4292 

16.4380 

-6.5698 

8.5474 

11.7895 

3.1567 

L> 

2.6489 

10.2661 

38.9543 

-10.1245 

-8.1323 

-8.0151 

10.2270 

6.5942 

-8.9136 

3.7036 

4.4458 

13.3130 

20 . 3052 

30.1770 

J-EANS  FOR  ALL  VARIABLES... 

m 

'  .1809 

-.0144 

-.8847 

-.0300 

-.4503 

-.2456 

.0841 

-.6612 

-.2816 

.  4231 

1.9894 

^TA^3DARD  DEVIATIONS  FOR  ALL  VAR: 

EA3LES. . . 

11.2156 

10.1579 

9.8358 

10.7963 

9.8302 

9.7790 

10.1817 

8.9774 

i 

8.7924 

1 

10.4552 

32.0968 

a 

DRRELATION 

MATRIX. . . 

1.0000 

c:: 

.0715 

1.0000 

c*. 

.0254 

.0648 

1.0000 

-.0757 

.2606 

.1868 

1.0000 

'  -.0717 

.1217 

.  2235 

.1323 

1.0000 

*> 

.0586 

.0191 

-.1219 

-.0018 

-.0893 

1.0000 

.  0552 

.0508 

.1240 

.1180 

-.0356 

.1012 

1.0000 

'V 

— .  062o 

-.0388 

-.0055 

.0173 

.1513 

-.0569 

.0051 

1 . 0  (J  U  0 

A 

«  •- 

-.0012 

1.0000 

.  1444 

.0992 

.0022 

-.1068 

-.0386 

.1414 

.  -.1325 

-.0664 

-.0577 

.0919 

.1316 

.1494 

-.1056 

-.0247 

.  0  0  8  4 

-.0353 

1.0000 

:v 

.  2760 

.  3818 

-.0691 

.4557 

.4252 

-.2753 

.2650 

.  382  4 

-.2871 

.  3955 

1.0000 

l^'TERMINANT 

=  .63S7E+00 

S 

i VERSE  OF 

THE  PART  O: 

THE  CORR 

MATRIX  USED 

IN  REGRES 

SION,  •  • 

8 

1.0337 

-.1058 

1.1368 

-.0619 

.0245 

1.1325 

.1037 

-.3037 

-.1702 

1.1582 

c 

.0725 

-.1531 

-.2427 

-.0484 

1.1465 

-.0457 

-.0205 

.  1248 

-.0244 

.0498 

1.0495 

A 

V 

-.0552 

.0048 

-.1244 

-.1179 

.  0534 

-.1206 

1 . 0  (w  3 

A 

.0485 

.0  342 

.0384 

-.0012 

-.1457 

.05  27 

- .  L  :■  3 1 

1.0  ; 

fi 

.0438 

1.0906 

-.1755 

-.1156 

.0636 

.1394 

.  0635 

-.1425 

/.IV 1  ; 

! 

I 


!><' 


.0352 

.0076 


.1086 

1.0630 


-.0447 


-.1430 


-.1252 


.0866 


.0271 


-.0393 


^ODUCT  OF  MATRIX  AND  INVERSE.  . 
lOULD  BE  IDENTITY  MATRIX.  IF 


NOT,  ROUNDOFF  ERRORS 


HAVE  RUINED  SOLUTION. 


DISREGARD  REST  OF  ANALYSIS, 


1 

1.0000 

i 

.0000 

1.0000 

I 

.0000 

-.000'* 

1.0000 

•j 

-.0000 

.000* 

-.000*  1.0000 

5 

a 

-.0000 

-.000* 

-.000*  -.000* 

1.0000 

j 

.0000 

-.000* 

-.000*  .000* 

-.000* 

1.0000 

.0000 

-.000* 

-.000*  -.000* 

.000* 

-.000*  1.0000 

■ 

.0000 

.000* 

-.000*  .000* 

.000* 

-.000*  .000* 

1.0000  ! 

-.0000 

.000* 

.000*  -.000* 

-.000* 

-.000*  .000* 

-.000*  y 

1.0000 

0.0000 

-.0000 

-.000*  0.0000 

-. 0000 

0.0000  -.0000 

0.0000 

t-: 

-.0000 

1.0000 

^^.NVERSE  OF 

V  • 

'  .  O  A  1  TT 

THE  PART  OF 

A 

THE  CROSS  PROD  MATRIX 

USED  IN 

REGRESSION. . . 

E? 

I 


•.9382E-05 
■.5672E-05 
.9068E-05 
.6643E-05 
•.4213E-05 
■.4879E-05 
. 4861E-05 
.4489E-05 
.1425E-03 
. 3029E-05 
.8335E-06 


1113E-03 

2478E-05 

2797E-04 

1549E-04 

2089E-05 

4724E-06 

3783E-05 

,1984E-04 

1033E-04 

,9823E-04 


1182E-03 

1619E-04 

2536E-04 

1311E-04 

1255E-04 

4391E-05 

1350E-04 


1004E-03 

4608E-05 

2332E-05 

1083E-04 

1294E-06 

6772E-05 


.1198E-03 
.5233E-05 
. 5385E-05 
-.1668E-04 
.1629E-04 


V.v' 


'••NDEX  OF  DEP  VAR  IN  REGRESSION  = 
•xNDICES  OF  INDEP  VARS  IN  REGRESSION... 

12  3 

V  9  10 

•'egression  COEFFICIENTS.  .  . 


-.4338E-05  -.1279E-04  -.1231E-04 

11 


1109E-03 

1223E-04 

6063E-05 

7464E-05 

,8555E-05 


1040E-U'' 
.4758E-;K^ 
,1612E-/^51'_ 

.2.57  3 E-i 


.9941  .9888 

9915  .9903 

.9758 

-.9970 

.  9946 

l.C2  2i  F-Jv 

-1.0133  .9899 

‘i-VALUES ... 

99.2642  85.2707 

-82. 

9539  89.9310 

81.0920 

-86.1448 

88.7290 

81.113;  ^ 

tm  -n  .2245  90.86  34 

H 

•>7D  DEVS  OF  i-i:C  CONES 

•  • 

.0100  .0116 

• 

0120  .0110 

.0120 

.0116 

.0112 

.0126  A--;.' 

.0131  .0109 

•>:iTERCEPT  1.05  89 

JyeSIDUAL  STAN'OARD  ERROR  = 

1.0992 

SAMPLE  MULTURE  CORRE 

eatio 

CO  N F F I  Ci  E;  .T  -- 

.9995 

“SAMPLE  COEFF I  Cl  ENl'  Or 

DET  !- 

RMTN'.Tlf - 

.  9R89 

OF  SQUARES  aTTRIC 

IJi'ALN 

E  'UO  RiiiGRxSSION, 

SOAR 

1 0  I  8  o  2  .  1  8 

] 

degrees  OF  FREEDOM  AS 

'•'INI  / 

I’Ei:;  VM  'IS'  SSAR  - 

in  .0000 

“i^AM  SQUARE  OF  5SAR  ^  1.0180.2793 

OF  SQUARES  OF  DEVIATIONS  FROM  RECNOISSLON,  SSDR  ^ 
^IGREEU  Of'  FREEDOM  ASSOC  L  .FE.  S'  V,  ON!  S.A.R  o9.0000 

:^EAN  SQUARE  OF  SSDR  1.2082 

•>- VALUE  -  8432.4961 


I 


107.531  ?. 


I 


»*C^OGRAM  I  TER  EG 

;-*,STIHATES  REGRESSION  COEFFICIENTS  BY  AN  ITERATIVE  METHOD 


I 

I 


TA  FILE  NAME:  DFILl 


NO  OF  OBS,  NO  OF  VARS,  INDEX  OF  DEP  VAR 

REGRESSION,  NO  OF  INDEP  VARS  IN  REGRESSION.. 
100  11  11  10 
IDICES  OF  INDEP  VARS  IN  REGRESSION... 


1 

2 

3  4 

5 

6 

7 

8 

10 

REGRESSION 

METHOD  USED 

AT  EACH  ITERATION: 

WALD'S  METHOD 

-/RINT  AFTER 

N  * 

EVERY  1  ITERATIONS 

Q.ATA.  .  . 

VFIRST  3 

OBS NS ) 

•'*  -18.327  6 

-15.0855  - 

13.7183  -4.2261 

13.3911 

-20.8667 

-6.9995 

4.5927 

-14.8S01 

3.3520 

25 . 5442 

4.2114 

6.8286 

-2.4292  16.4380 

-6.5698 

8.5474 

11.7895 

3.1567 

d  2.6489 

10.2661 

38.9543 

-10.1245 

-8.1323 

-8.0151  10.2270 

6.5942 

-8.9136 

3.7036 

4.4458 

13.3130 

20.3052 

30.1770 

.VeANS  FOR  VARIABLES  IN 

REGRESSION, 

DEPENDENT  VARIABLE  LAST 

•  •  • 

||  .1809 

-.0144 

-.8847  -.0300 

-.4503 

-.2456 

.0841 

-.6612 

■  -.2816 

.  4231 

1.9894 

fi^EDIANS.  .  . 

.1880 

.8130 

-.7495  .5786 

-.8276 

-.2026 

-1.4136 

-1.2183 

-.0464 

.1489 

.  3645 

WTERQUARTILE  RANGES... 

15.4688 

14.0235 

12.7734  16.7969 

13.5547 

10.8985 

15.6250 

10.5766 

12.5390 

14.8828 

38.5937 

rO-GKESSlON 

COEFFICIENTS 

,  INTERCEPT... 

.7239 

1.1009 

-.1421  1.1834 

1.3133 

-1 . 3554 

.  8399 

1 . 2278 

^  -.9845 

.  6267 

2.2392 

1.0218 

.7820 

-1.0716  1.0741 

1.0202 

-1.0831 

1.0582 

1.0402 

-.9638 

.  9634 

1.0131 

1.0310 

.9516 

-1.0316  .9993 

.9905 

-.9890 

1.0075 

1.0262 

-1.00  61 

.9872 

1.0298 

^  1.0052 

.  9900 

-.9849  .9904 

.9751 

-.9918 

1.0021 

1.0175 

,  .  -1.02  34 

.  98  35 

1.0598 

.9931 

.  99  3  4 

-.9748  .9903 

.9713 

-.9948 

1,00  39 

1.0147 

-1.026  9 

.  9825 

1 .0652 

.  997] 

.  99  3  4 

-.9736  .9903 

.9709 

-■ .  9  9  5  3 

1.0046 

1.0141 

-1.027  3 

.  98  21 

1.0656 

C  .9970 

.  99  34 

-.9735  .9902 

.9709 

-.9953 

1,0047 

1.0141 

-1.0273 

.  98  21 

:l  .  0  65  6 

M  -9969 

.  99  3'’- 

-.9735  .9901 

.  97  08 

-.99  5  ;< 

1.0047 

1.0141 

-1.0274 

.  98  2  4 

1.0655 

.  9969 

.  993  4 

-.9735  .9901 

.9708 

-.9953 

1.0047 

1.0140 

-1.0274 

.  98  2  4 

1.0656 

i[  -  9969 

.  9935 

-.9735  .9901 

.9708 

-.  9953 

1.00  47 

1.0140 

•V*'.  *'• 

JROGRAM  SIMULA 

ENERATES  MULTIVARIABLE  UNIVARIATE  SAMPLE 


FILE  NAME:  DEI LI 
ISiMENSION  (M)  =  10 


I  EFFICIENT  MATRIX  (C). 


a 

1.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

? 

0.0000 

1.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

y.. 

.1000 

.1000 

1.0000 

.1000 

.1000 

.1000 

.1000 

.1000 

.1000 

.1000 

0.0000 

0.0000 

0.0000 

1.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

.2000 

.2000 

.2000 

.2000 

1.0000 

.2000 

.2000 

.2000 

‘v 

.2000 

.2000 

V 

%  j 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

1.0000 

0.0000 

0.0000 

0.0000 

0.0000 

i 

.3000 

.  3000 

.  3000 

.3000 

.3000 

.3000 

1.0000 

.  3000 

.3000 

.3000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

1.0000 

0.0000 

0.0000 

.4000 

.  4000 

.4000 

.4000 

.4000 

.4000 

.4000 

.4000 

V 

1.0000 

.4000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

i 

0.0000 

1.0000 

SIGMA  (SCALING  PARAMETER  FOR  INDEP  VARIABLES)  = 

10.0000 

•** 

(INTERCEPT)  = 

1.0000 

para:-ieter  vector  (b). 

•  • 

9 

1.0000 

1.0000 

-1.0000 

1.0000 

1.0000 

-1.0000 

1.0000 

1.0000 

> 

-1.0000 

1.0000 

•>IG  (MODEL 

•  ^ 

ERROR  STD 

DEV)  = 

1.0000 

NO 

OF  OBSNS 

=  100 

S^IP  0  RANDOM  NOS  PRIOR  TO  SIMULATION 

r: 

VARIANCE  MATRIX  OF  X, 

y. . . 

100.0000 

0.0000 

10.0000 

0.0000 

20.0000 

0.0000 

30.0000 

O.OCOO 

»**T. 

40.0000 

0.0000 

100.0000 

a 

0.0000 

100.0000 

10.0000 

0.0000 

20.0000 

0.0000 

30.0000 

0.0000 

40.0000 

0.0000 

100.0000 

10.0000 

10.0000 

109.0000 

10.0000 

46.0000 

10.0000 

64.0000 

10.0000 

82.0000 

10.0000 

-41.0000 

0.0000 

0 . 0000 

10.0000 

100 . 0000 

20.0000 

0.0000 

30.0000 

0.0000 

40.0000 

0 . 0000 

100.0000 

r 

20.0000 

20 . 0000 

46.0000 

20.0000 

136.0000 

20.0000 

90.0000 

20.0000 

L. 

124.0000 

20.0000 

144.0000 

0.0000 

0.0000 

10.0000 

0 . 0  000 

20.0000 

100.0000 

30.0000 

0.0000 

b  "• 

40.0000 

0.0000 

-100.0000 

30.0000 

30.0000 

64.0000 

30 . 0000 

98.0000 

30.0000 

181.0000 

30.0000 

166.0000 

30.0000 

169.0000 

i 

O.OCOO 

0.0000 

10.0000 

0.0000 

20.0000 

0.0000 

30.0000 

100.0000 

I 


I 


40.0000 

40.0000 

244.0000 

0.0000 

40.0000 

100.0000 

124.0000 


0.0000 

40.0000 

40.0000 

0.0000 

100.0000 

100.0000 

100.0000 


100.0000 

82.0000 

124.0000 

10.0000 

100.0000 

-41.0000 

831.0001 


40.0000  124.0000  40.0000  166.0000  40.0000 


0.0000  20.0000 


0.0000  30.0000 


0.0000 


COEFFICIENT  OP  DETERMINATION  = 


100.0000  144.0000  -100.0000  169.0000  100.0000 


.9988 


iRRELATION 

MATRIX  OF 

X,  Y. . . 

1.0000 

0.0000 

.0958 

0.0000 

■ 

.2561 

0.0000 

.3469 

0.0000 

1.0000 

.0958 

0.0000 

.2561 

0.0000 

.3469 

.0958 

.0958 

1.0000 

.0958 

K 

v 

.5028 

.0958 

-.1362 

0.0000 

0.0000 

.0958 

1.0000 

.2561 

0.0000 

.  3469 

.1715 

.1715 

.3778 

.1715 

>: 

.  6807 

.1715 

.4283 

0.0000 

0.0000 

.0958 

0.0000 

.  2561 

0.0000 

-.3469 

.2230 

.2230 

.  4556 

.2230 

.7899 

.2230 

.4358 

0.0000 

0.0000 

.0958 

0.0000 

.  2561 

0.0000 

.3469 

.2561 

.2561 

.5028 

.2561 

1.0000 

.2561 

.2754 

0.0000 

0.0000 

.0958 

0.0000 

1 

.2561 

1.0000 

.3469 

.  3469 

.  3469 

-.1362 

.3469 

5' 

.2754 

.  3469 

1.0000 

OBSERVATIONS. 

•  • 

ni 

;PENDENT  VARIABLE  LAST 

i 

'IRST  3 

OBSNS  PRINTED) 

&BSN  NO 

1: 

-18.3276 

- 

15.0855 

-19.5842 

-4.2261 

-37 .8353 

3.3520 

17.6379 

JSN  NO 

2: 

4.2114 

6.8286 

3.3025 

16.4380 

a. 

23.5444 

10.2661 

37.5481 

-feSN  NO 

3: 

-10.1245 

-8.1323 

-4.8733 

10.2270 

17.3491 

20.3052 

32.2707 

.1715 

0.0000 

.2230 

0.0000 

.1715 

0.0000 

.2230 

0.0000 

.3778 

.0958 

.4556 

.0958 

.1715 

0.0000 

.2230 

0.0000 

1.0000 

.1715 

.6246 

.1715 

.1715 

I.OUOO 

.2230 

0.0000 

.6246 

.2230 

1.0000 

.  2230 

.1715 

0.0000 

.2230 

1.0000 

.6807 

.2561 

.7899 

.  2561 

.1715 

0.0000 

.  2230 

0.0000 

.  4283 

-.3469 

.4358 

.3469 

-3.7627 

-20.8667 

-26 . 6131 

4.9927 

5.7217 

8.5474 

24.7190 

3.1557 

9.9560 

-8.9136 

9.6135 

4.4458 

PROGRAM  MREG 

»LTIPLE  LINEAR  REGRESSION  ANALYSIS 
DATA  FILE  NAME:  DFILl 

I  OF  OBS,  NO  OF  VARS,  INDEX  OF  DEP  VAR 


IN  REGRESSION,  NO  OF  INDEP  VARS  IN  REGRESSION... 


m  100 

11 

11 

10 

S^DICES  OF 

INDEP  VARS 

IN  REGRESSION... 

1 

2 

3 

4 

5 

6 

7 

8 

10 

MkTA.  .  . 

(FIRST  3 

OBSNS) 

-18.3276 

-15.0855 

-19.5842 

-4.2261 

-3.7627 

-20 .8667 

-26.6131 

4.9927 

M  -37 .8853 

3.3520 

17.6379 

4.2114 

6.8286 

3.3025 

16.4380 

5.7217 

8.5474 

24.7190 

3.1567 

23.5444 

10.2661 

37.5481 

-10.1245 

-8.1323 

-4.8733 

10.2270 

9.9560 

-8.9136 

9.6135 

4.4458 

'  17.3491 

20 . 3052 

32.2707 

^ANS  FOR  ALL  VARIABLES... 

■  .1809 

-.0144 

-.9842 

-.0300 

-.7362 

-.2456 

-.5050 

-.6612 

-.9207 

.4231 

1.8532 

Standard  deviations  for  all  variables... 

11.2156 

10.1579 

11.0153 

10.7963 

12.6680 

9.7790 

15.3408 

8.9774 

^  16.7076 

10.4552 

29.6484 

CORRELATION 

MATRIX.  .  . 

1.0000 

ft  .0715 

1.0000 

^  .1161 

.2030 

1.0000 

-.0757 

.2606 

.3178 

1.0000 

■  .1219 

.  3381 

.5962 

.3737 

1.0000 

«  .0586 

.0191 

-.0289 

-.0018 

.0647 

1.0000 

.2317 

.  3488 

.6098 

.4160 

.6869 

.1958 

1.0000 

I*.  -.062  6 

-.0388 

.0732 

.0173 

.2289 

-.0569 

.1695 

1.0000 

ft  .2519 

.  4437 

.6629 

.4428 

.7419 

.1700 

.8684 

.1628 

1.0000 

.  -.0664 

-.0577 

.1822 

.1316 

.2812 

-.1056 

.2220 

.0684 

5  .2746 

1.0000 

.2985 

.4192 

.1470 

.4788 

.5795 

-. 3112 

.5123 

.  3875 

.4474 

.4197 

1.0000 

Seterminant 

=  .2064E- 

01 

inverse  of 

THE  PART  OF 

THE  CORK 

MATRIX  USED 

IN  REGRESSION. . . 

jk  1.1878 

.1190 

1.4176 

-14  37 

.  3235 

2.0519 

H  .2815 

-.0652 

.0275 

1.3480 

■  .0597 

-.1781 

-.  4746 

-.0985 

2.5064 

.0861 

.1732 

.  3451 

.1239 

.0269 

1.1623 

-.1331 

.1639 

-. 2461 

-.2198 

-.3721 

-.2769 

4.3004 

‘•3  .1608 

.  2041 

.2125 

.1265 

-.2961 

.1486 

-.1257 

1.1297 

-.5956 

-1.0093 

-1.1262 

-.4446 

-1.0076 

-.  4453 

-3.1919 

-.2302 

B  6.3430 

■  .1970 

-.6155 


.3345 

1.2307 


.1700 


.0356 


-.2294 


.2253 


.0804 


g^ODUCT  OF  MATRIX  AND  INVERSE. . . 

SHOULD  BE  IDENTITY  MATRIX.  IF  NOT,  ROUNDOFF  ERRORS 


^VE  RUINED  SOLUTION, 


DISREGARD  REST  OF  ANALYSIS. 


1.0000 

-.0000 

.0000 

.0000 

.0000 

.0000 

1.0000 

.000* 

.000* 

.000* 

.000* 

1.0000 

.000* 

.000* 

.000* 

1.0000 

.000* 

.000* 

1.0000 

.000* 

1.0000 

>'  ' 

.0000 

.000* 

-.000* 

.000* 

.000* 

.000* 

1.0000 

-.0000 

-.000* 

.000* 

-.000* 

.000* 

-.000* 

.000* 

1.0000 

.0000 

1.0000 

.000* 

.000* 

.000* 

.000* 

.000* 

.000* 

.000* 

.0000 

-.0000 

.000* 

1.0000 

.000* 

.000* 

-.000* 

.000* 

.000* 

.000* 

m 

^Inverse  of  the  part  of  the  cross  prod  matrix  used  in  regression.  . . 

'fjX  p  1— I  A 


.9538E-04 

.1055E-04 

.1175E-04 

.2348E-04 

.4242E-05 

.7931E-05 

-.7813E-05 

.1613E-04 

-.3210E-04 

.2295E-03 

.1697E-04 

-.3559E-04 


.1388E-03 

.2920E-04 

-.6005E-05 

-.1398E-04 

.1761E-04 

.1062E-04 

.2261E-04 

-.6007E-04 

.3182E-04 

.1137E-03 


.1708E-03 

.2334E-05 

-.3436E-04 

.3236E-04 

-.1471E-04 

.2170E-04 

-.6181E-04 


.1168E-03 

-.7272E-05 

.1185E-04 

-.1341E-04 

.1318E-04 

-.2490E-04 


1578E-03 

,2191E-05 

,1934E-04 

,2629E-04 

,4809E-04 


.1228E-03 

■.1865E-04 

.1710E-04 

■.2753E-04 


.1491E-04  .3183E-05  -.1749E-04  .2226E-04 


,  1846E-'S; 
,9220E-\^- 
,1258E- , 

.5065E- 


;^NDEX  of  DE 

indices  of 

H  9 

Degression 
1.0007 
-1.0111 
Lvalues  . . . 

93.2158 
^  -60.7164 
kTD  DEVS  OF 
.0107 
.0167 

^TERCEPT  = 

Residual  st 


P  VAR  in  regression  = 
INDEP  VARS  IN  regression. 
2  3 

10 

COEFFICIENTS... 


9954 

9965 


-.9832 


93.2158  76.8713  -68.4353  83.9122  70.8374  ■ 

^  -60.7164  85.0077 

kTD  DEVS  OF  REG  COEFS... 

.0107  .0129  .0144  .0119  .0138 

.0167  .0117 

5;;fJTERCEPT  =  1.0589 

Residual  standard  error  =  1.0992 

SAMPLE  multiple  CORRELATION  COEFFICIENT  =  .9994 

J'AMPLK  COEFFICIENT  OF  DETERMINATION  =  .9988 

•Cum  OF  SQUARES  ATTRIBUTABLE  TO  REGRESSION,  SSAR  = 
DEGREES  OF  FREEDOM  ASSOCIATED  WITH  SSAR  =  10.0000 

jt^.AM  SQUARE  OF  SSAR  =  8691.6211 

PUM  OF  SQUARES  OF  DEVIATIONS  FROM  REGRESSION,  SSDR  = 
DEGREES  OF  FREEDOM  ASSOCIATED  WITH  SSDR  =  89.0000 

^.AN  SQUARE  OF  SSDR  =  1.2083 

ffl-VALUE  =  7193.2402 


.9969 


,9780 


-.9903 


83.9122  70.8374  -81.3103 


0122 


1.0018 


67.0802 


0149 


,0238 


6530 


,0131 


m 


86916.2109 


107.5391 


m 


te' 


fOGRAM  ITEREG 

TIMATES  REGRESSION  COEFFICIENTS  BY  AN  ITERATIVE  METHOD 
DATA  FILE  NAME:  DFILl 

1  OF  OBS,  NO  OF  VARS,  INDEX  OF  DEP  VAR 
IN  REGRESSION,  NO  OF  INDEP  VARS  IN  REGRESSION . . . 

m  100  11  11  10 

INDICES  OF  INDEP  VARS  IN  REGRESSION... 

1  2  3  4  5 

S9  10 

^EGRESSION  METHOD  USED  AT  EACH  ITERATION:  WALD'S  METHOD 


INT  AFTER  EVERY  4  ITERATIONS 


DATA. . . 

•vyiRST  3  OBSNS) 

-18.3276  -15.0855  -19.5842 

-37.8853  3.3520  17.6379 


4.2114 
D  23.5444 
■  -10.1245 
17.3491 


6.8286 

10.2661 

-8.1323 

20.3052 


3.3025 
37 .5481 
-4.8733 
32.2707 


-4.2261 

16.4380 

10.2270 


-3.7627  -20.8667 

5.7217  8.5474 


9.9560 


-8.9136 


AtlANS  FOR  VARIABLES  IN  REGRESSION, 
DEPENDENT  VARIABLE  LAST... 


RESIDUAL  VARIANCE  = 
*:SIDUAL  STANDARD  ERROR  = 


1.3789 


1.1743 


-26.6131  4.9927 

24.7190  3.1567 

9.6135  4.4458 


^  .1809 

-.0144 

-.9842 

-.0300 

-.7362 

-.2456 

-.5050 

-.6612 

1  -.9207 

.4231 

1.8532 

WjEDIANS.  .  . 
.1880 

.8130 

-.8577 

.5786 

-.0596 

-.2026 

.3088 

-1.2183 

*■  -.1978 

.1489 

-.7137 

PiTERQUARTILE  RANGES.. 

• 

V  15.4688 

14.0235 

15.0469 

16.7969 

17.5469 

10.8985 

23.4218 

10.9766 

24.0156 

14.8828 

36.7695 

EGRESSION 

COEFFICIENTS,  INTERCEPT. . . 

.9768 

.7948 

-1.0648 

1.0780 

.9434 

-1.0708 

.6114 

.9883 

n  -.5644 

.8971 

1.1667 

p  .9779 

.9134 

-1.0462 

1.0049 

.9431 

-1  .0295 

.8290 

1.0049 

e.  _  7777 

.9509 

1.0958 

.9858 

.9527 

-1.0003 

1.0032 

.9334 

-1.0069 

.9064 

1.0132 

-.8737 

.9705 

1.0865 

«  .9919 

.9696 

-.9829 

1.0030 

.9399 

-.9985 

.9424 

1.0169 

-.9231 

.9779 

1.0816 

3*;  .9951 

.9782 

-.9755 

1.0022 

.9450 

-.9946 

.9614 

1.0188 

g  -.9489 

.9815 

1.0786 

^INT  AFTER 

EVERY  4 

ITERATIONS 

■  .9967 

.9827 

-.9717 

1.0016 

.9476 

-.9926 

.9713 

1.0198 

■  -.9624 

.9835 

1.0770 

.9975 

.9850 

-.9698 

1.0014 

.9489 

-.9915 

.9765 

1.0203 

SI  -.9693 

.9845 

1.0762 

^ 


Hefficient  of  determination  = 


.9984 


I 

I 


OGRAM  SIMULA 

NERATES  MULTIVARIABLE  UNIVARIATE  SAMPLE 


LE  NAME;  DFILl 
MENSION  (M)  =  10 
gDEFFICIENT  MATRIX  (C) 


1 


'4.* 


1.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

.1000 

.1000 

1.0000 

.1000 

.1000 

.1000 

.1000 

.1000 

.1000 

.1000 

.2000 

.2000 

.2000 

.2000 

1.0000 

.2000 

.2000 

.2000 

.2000 

.2000 

.3000 
.  3000 

.  3000 
.  3000 

.3000 

1.0000 

.3000 

.3000 

.3000 

.3000 

.  4000 
.4000 

.4000 

.4000 

.4000 

.4000 

1.0000 

.4000 

.4000 

.4000 

.5000 

.5000 

.5000 

.5000 

.5000 

.5000 

.5000 

1.0000 

.5000 

.5000 

.6000 

.6000 

.6000 

.6000 

.6000 

.6000 

.6000 

.6000 

1.0000 

.6000 

.7000 

.7000 

.7000 

.7000 

.7000 

.7000 

.7000 

.7000 

.7000 

1.0000 

.8000 

1.0000 

.8000 

.8000 

.8000 

.8000 

.8000 

.8000 

.8000 

.8000 

.9000 

.9000 

.9000 

1.0000 

.9000 

.9000 

.9000 

.9000 

.9000 

.9000 

% 

•r, 

k-, 

I 

SIGMA  (SCALING  PARAMETER  FOR  INDEP  VARIABLES)  = 
:0  (INTERCEPT)  =  1.0000 


10.0000 


D 

1 

AR/'i'lETER  VECTOR  (B) 
1.0000  1.0000 

•  •  • 

-1.0000 

1.0000 

1.0000 

-1.0000 

1.0000 

1.0000 

-1.0000  1.0000 

4^1  G  (MODEL  ERROR  STD 

N 

;iO  OF  OBSNS  =  100 
^IP  0  RANDOM  NOS 

'variance  matrix  OF  X 
^  100.0000  10.0000 

DEV)  =  1.0000 

PRIOR  TO  SIMULATION 

,  Y.  .  . 

20.0000  30.0000 

40.0000 

50.0000 

60.0000 

70.0000 

80.0000 

10.0000 

90.0000 

109.0000 

250.0000 

46.0000 

64.0000 

82.0000 

100.0000 

118.0000 

136.0000 

154.0000 

20.0000 

172.0000 

46.0000 

391.0000 

136.0000 

98.0000 

124.0000 

150.0000 

176.0000 

202.0000 

228.0000 

30.0000 

254.0000 

64.0000 

406.0000 

98.0000 

181.0000 

166.0000 

200.0000 

234.0000 

268.0000 

I 

302.0000 

40.0000 

336.0000 

82.0000 

679.0002 

124.0000 

166.0000 

244.0000 

250.0000 

292.0000 

334.0000 

376.0000 

50.0000 

417.9999 

100.0000 

826.0000 

150.0000 

200.0000 

250.0000 

325.0000 

350.0000 

400.0000 

I 

450.0001 

60.0000 

500.0000 

118.0000 

924.9999 

176.0000 

234.0000 

292.0000 

350.0000 

424.0000 

466.0000 

■ 

524.0000 

70.0000 

582.0000 

136.0000 

1126.0001 

202.0000 

268.0000 

334.0000 

400.0000 

466.0000 

540.9999 

I 

f 

1  597.9999 

664.0001 

1279.0002 

80.0000 

154.0000 

228.0000 

302.0000 

376.0000 

450.0001 

524.0000 

597 .9999 

1  676.0000 

745.9999 

1426.0000 

n 

;  90.0000 

172.0000 

254.0000 

336.0000 

417.9999 

500.0000 

582.0000 

664.0001 

745.9999 

829.0000 

1591.0002 

,  250.0000 

391.0000 

406.0000 

679.0002 

826.0000 

924.9999 

1126.0001 

1279.0002 

1 

1.426.0000 

1591.0002 

3386.0010 

COEFFICIENT 

OF  DETERJII NATION  = 

.9997 

i 

5RRELATION 

MATRIX  OF 

X,  Y.  .  . 

1.0000 

.0958 

.1715 

.2230 

.2561 

.2774 

.2914 

.3010 

II 

.3077 

.3126 

.4296 

s 

.0958 

1.0000 

.3778 

.4556 

.5028 

.5313 

.5489 

.  5601 

.5673 

.5722 

.6436 

.1715 

.  3778 

1.0000 

.6246 

.6807 

.7135 

.7329 

.7447 

.7520 

.7565 

.5983 

!Cm 

.2230 

.4556 

.6246 

1.0000 

.7899 

.8246 

.8447 

.8564 

.8634 

.8674 

.8673 

••  ” 

.2561 

.  5028 

.6807 

.7899 

1.0000 

.8878 

.9078 

.9193 

\* 

.9258 

.9294 

.9087 

.2774 

.5313 

.7135 

.8246 

.8878 

1.0000 

.9429 

.9539 

h  *, 

.9601 

.9633 

.8818 

.2914 

.5489 

.7329 

.8447 

.9078 

.9429 

1.0000 

.9730 

.9788 

.9817 

.9397 

.  3010 

.  5601 

.7447 

.8564 

.9193 

.9539 

.9730 

1.0000 

Jv 

.9888 

.9915 

.9450 

.3077 

.5673 

.7520 

.8634 

.9258 

.9601 

.9788 

.9888 

1.0000 

.9965 

.9425 

i 

.3126 

.  5722 

.7565 

.8674 

.9294 

.9633 

.9817 

.9915 

■ 

.9965 

1.0000 

.9496 

.  4296 

.  6436 

.5983 

.8673 

.9087 

.8818 

.9397 

.  9450 

a: 

.9425 

.9496 

1.0000 

r.. 

OBSERVATIOi-;S.  .  . 
^PENDENT  VARIABLE  LAST 


5^IRST  3  OBSNS  PRINTED) 
'OBSN  NO  1 : 


■^■3SN  NO 

4.2114 
?•  44.4399 

t^SN  NO 
-10.1245 
j.-  21.3852 


I 

I 

ft 


-20.8147 

-64.8050 

0  . 

-25.4502 

-113.0040 

-24.6717 

-20.9165 

-46.6223 

-46.2266 

-49.1668 

11.6345 

50.4255 

9.0342 

105.6835 

27.9729 

18.0132 

31.7175 

37 .6484 

39 .3604 

2  - 

-4.9788 

23.0935 

-1.7315 

41.3710 

14.1799 

13.3178 

7.2448 

15.5234 

17 .7160 

♦ 

I 

I 


...  .f .  v.i- 


OGRAM  MR EG 

ILTIPLE  LINEAR  REGRESSION  ANALYSIS 


iTA  FILE  NAME:  DFILl 


OF  OBS,  NO 


i  REGRESSION, 
100 

VOICES  OF  INDEP 
1 


OF  VARS,  INDEX  OF 
NO  OF  INDEP  VARS 
11 

VARS 


IN 


DEP  VAR 
IN  REGRESSION. 
11  10 
REGRESSION. . . 


2 

10 


DATA. . . 
iFIRST 
« -18.3276 
^  -60.8804 
4.2114 
>  44.4399 


OBSNS ) 
-20.8147 


-10.1245 

21.3852 


-64, 

11, 

50, 

“4, 


8050 

6345 

4255 

9788 


23.0935 


-25.4502 

-113.0040 

9.0342 

105.6835 

-1.7315 

41.3710 


-24.6717  -20.9165  -46.6223 

27.9729  18.0132  31.7175 

14.1799  13.3178  7.2448 


-46.2266 

37.6484 

15.5234 


lipANS  FOR  ALL 
.1809 


-1.5599 


VARIABLES. . . 
-.2009  -1.0837 

-1.6493  -1.1327 


S'tANDARD  DEVIATIONS  FOR  ALL  VARIABLES... 


-49.1668 

39-3684 

17.7150 


-.5849  -1.0220  -1.0626  -1.0941  -1.5141 


,  11.2156 

11.2752 

12.9436 

16.0995 

17.7264 

19.7445 

23.7825 

26.1492 

||  29.528  2 

32.9943 

67.48  30 

CORRELATION 

MATRIX. .  . 

1.0000 

:!•  .1514 

1.0000 

.1783 

.5113 

1.0000 

M  .1608 

.6446 

.7567 

1.0000 

"  .2139 

.6434 

.8100 

.8599 

1.0000 

.2814 

.6573 

.7846 

.8749 

.9099 

1.0000 

.  .  2753 

.6649 

.8248 

.8957 

.9292 

.9590 

1.0000 

*  .27  57 

.6777 

.8313 

.9052 

.9497 

.9627 

.9813 

1.0000 

.2354 

.6933 

.8404 

.9095 

.9467 

.9672 

.9866 

.9921 

1.0000 

T  .2853 

.6878 

.8412 

.9138 

.9523 

.9684 

.  9866 

.  9945 

.9977 

1.0000 

.  3872 

.7512 

.7412 

.9097 

.9324 

.9197 

.  9560 

.9659 

.9638 

.9680 

1.0000 

TOTERMINANT 

=  .1048E- 

08 

^VERSE  OP 

THE  PART  OP 

THE  CORR 

MATRIX  US 

ED  IN  REGRESSION... 

M  1.2492 

.1726 

2.0793 

fir  .3164 

.  5738 

3.7911 

B  .7514 

-.0601 

.  3480 

6.4310 

.7383 

.  2471 

-.1940 

.8414 

11.6186 

to  .1613 

.4591 

1.6436 

.8575 

2.0441 

17 . 2304 

p  .6074 

1.2267 

.5277 

1.1366 

5.0786 

-.8493 

42 . 3096 

^  -2.0589 

-3.7660 

-4.9050 

-5.9452 

-13.3671 

-15.6544 

-40 . 467  3 

-21.7312 

-2.0589 

-3.7660 

-4.9050 

-5.9452 

-13.3671 

-15.6544 

-40.4673 

28 . 4020 

1  90.7163 

2.0589 

-90.7163 


3.7660 

55.6381 


4.9050 


5.9452  13.3671  15.6544 


40.4673  -38.8296 


g^ODUCT  OF  MATRIX  AND  INVERSE. . . 

SHOULD  BE  IDENTITY  MATRIX.  IF  NOT,  ROU.^DOFF  ERRORS 


f' 


'E  RUINED 

SOLUTION. . . 

DISREGARD 

REST  OF 

ANALYSIS. 

1.2556 

.5865 

1.0617 

.7119 

.0645 

1.9018 

.7687 

.0606 

.9678 

4.0301 

.7869 

.0347 

.9728 

3.1224 

7.2449 

.8072 

.0493 

1.0068 

3.1927 

6.3799 

5.4214 

.8210 

.0476 

1.0224 

3.2494 

6.4941 

4.4960 

8.3334 

.8035 

-.0005 

.9697 

3.2154 

6.4452 

4.3768 

6.9886 

-85.4023 

.8328 

49.9736 

.0584 

1.0445 

3.2812 

6.5137 

4.5306 

7.3777 

-85.3785 

.8442 

.  0575 

1.0576 

3.3284 

6.6131 

4.5963 

7.4849 

-85.6612 

49.7392 

-63.9288 

iVEKSE  OF  THE 
.1003E-03 
.1379E-04 
. 2202E-04 
.4204E-04  ■ 

.3751E-04 
.7379E-05 
.2300E-04 
-.70S1E-04  ■ 

-.6280E-04  ■ 

.1051E-02 
.5620E-04 
-.9405E-03 


PART  OF  THE  CROSS  PROD  MATRIX  USED  IN  REGRESSION. 


.1652E-03 
.3971E-04 
.3344E-05 
.1249E-04 
. 2083E-04 
. 4621E-04 
.1290E-03 
.1143E-03 

.1023E-03 

.5162E-03 


.2286E-03 

,1687E-04 

,8540E-05 

,6496E-04 

,1732E-04 

,1464E-03 

,1296E-03 


.2506E-03 

.2978E-04 

.2725E-04 

.2999E-04 

-.1426E-03 

-.1263E-03 


1160E-03  .1131E-03 


.3735E-03 
. 5899E-04 
.1217E-03 
•.2913E-03 
•.2580E-03 

.2309E-03 


.4464E-03 

-.1827E-04 

-.3063E-03 

-.2712E-03 

.2427E-03 


.7556E- 
.6573E- 
■.  5821E- 

.5209E- 


1 


9^ 


DEP  VAR 


REGRESSION  = 


'i'NDICES  OF 

_  1 

INDEP  VARS  IN  REGRESSION... 

2  3  4 

5 

6 

7 

8 

?  5 

xT-EGRESSION 

.9687 

10 

COEFFICIENTS.  .  . 

.8740  -1.0836  .9737 

.9768 

-1.1008 

.6742 

-169.3762 

£•:  -39.3268 
j.2*- VALUES.  .  . 
.1583 

14.5951 

.1113  -.1173  .1007 

.0827 

-.0853 

.0401 

-15.4745 

igi  -1.9858 
‘STd  DEVS  OF 
6.1186 

1.0515 

REG  COEFS. .  . 

7.8522  9.2360  9.6712 

11.8062 

12.9079 

16.7925 

10.9455 

^  19.8043  13.8804 

INTERCEPT  =  -294.8914 

feSIDUAL  STANDARD  ERROR  =  610.9026 

SAMPLE  MULTIPLE  CORRELATION  COEFFICIENT  = 

8.5249 

Sample  coefficient  of  determination  =  -72.6732 

K’UM  OF  SQUARES  ATTRIBUTABLE  TO  REGRESSION,  SSAR  :^-32764136, 

DEGREES  OP  FREEDOM  ASSOCIATED  V?ITH  SSAR  =  10.0000 

IEAN  SQUARE  OF  SSAR  =  -3276413.5000 
UM  OF  SQUARES  OF  DEVIATIONS  FROM  REGRESSION,  SSDR 
EGREES  OF  FREEDOM  ASSOCIATED  WITH  SSDR  =  89.0000 

MEAN  SQUARE  OF  SSDR  =  373202.0000 

K-VALUE  =  -8.7792 


0000 


33214978.0000 


‘W' 


I^OGRAM  ITEREG 

C3TIMATES  REGRESSION  COEFFICIENTS  BY  AN  ITERATIVE  METHOD 


I 


ATA  FILE  NAME;  DFILl 


OF  OBS,  NO  OF  VARS,  INDEX  OF  DEP  VAR 
■N  REGRESSION,  NO  OF  INDEP  VARS  IN  REGRESSION... 

I  100  11  11  10 

^DICES  OF  INDEP  VARS  IN  REGRESSION... 

1  2  3  4  5 

9  10 


REGRESSION  METHOD  USED  AT  EACH  ITERATION;  WALD’S  METHOD 


6  7 


8 


I^INT  AFTER 

EVERY  40 

ITERATIONS 

DATA. . . 

v'FIRST  3 

OBSNS ) 

-18.3276 

-20.8147 

-25.4502  - 

■24.6717 

-20.9165 

-46.6223 

-46.2266 

-49.1668 

-60.8804 

-64.8050  - 

113.0040 

4.2114 

11.6345 

9.0342 

27.9729 

18.0132 

31-7175 

37.6484 

39.3684 

^  44.4399 

50.4255 

105.6835 

-10.1245 

-4.9788 

-1.7315 

14.1799 

13.3178 

7.2448 

15.5234 

17.7160 

..  21.3852 

23.0935 

41.3710 

iVEANS  FOR  VARIABLES  IN 

REGRESSIO'': 

’  / 

DEPENDENT  VARIABLE  LAST... 

H  .1809 

-.2009 

-1.0837 

-.5849 

-1.0220 

-1.0626 

-1.0941 

-1.5141 

■  -1.5599 

-1.6493 

-1.1327 

‘^^SDIANS. 


.1880 

.0173 

-1.1221 

-1.7224 

.4272 

-1.7786 

-1.4766 

-3.6473 

-2.5523 

-2.8128 

-.5040 

RnITERQUARTILE  RANGES.. 

*  • 

15.4688 

15.2031 

18.2188 

24.4649 

27.7266 

30.3711 

36.3750 

39.9383 

49.3359 

51.7383 

93.7500 

•-EGRESSION 

COEFFICIENTS,  INTERCEPT. .  . 

1.0906 

1.0271 

-.9463 

.8936 

1.8787 

1.0710 

.2496 

.0809 

.1764 

-.4496 

1.3606 

s* 

1.0074 

1.0439 

-.8263 

1.0221 

1.0995 

.  2052 

1.1456 

.1096 

.  3533 

-.4498 

1.1670 

1.0093 

1.0351 

-.8215 

1.1452 

.9199 

-.5670 

1.5165 

.  3191 

.2901 

-.3364 

1.0498 

Jk 

1.0185 

1.0238 

-.8448 

1.1827 

.9041 

-.8994 

1.5305 

.5827 

.1307 

-.2183 

1.0337 

1.0240 

1.0140 

-.8707 

1.1778 

.9241 

-.9924 

1.4104 

.8170 

,v 

-.0271 

-.1211 

1.0600 

EAR  I  NT  AFTER 

EVERY  40 

ITERATIONS 

■ 

1.0265 

1.0061 

-.8934 

1.1602 

.9481 

-.9931 

1.2661 

.9991 

-.1501 

-.0466 

1.0947 

1.0275 

.9998 

-.9118 

1.1422 

.9695 

-.9673 

1.1377 

1.1284 

i 

-.2356 

.0087 

1.1252 

1.0278 

.9949 

-.9263 

1.1272 

.9871 

-.9388 

1.0349 

1.2158 

-.2907 

.0490 

1.1484 

1 

1.0278 

.9912 

-.9374 

1.1160 

1.0012 

-.9145 

.9568 

1. 2720 

1.1160 


I.U012 


9145 


9568 


1. 2720 


w.v».ir«i.»TL-^  WJ.'VtW IW.l/^UIIWflmwrTiP^-ipw.T.'^ri wri  wui'gv^'i «  i  .  .l.■Lll■.»....■.,.,-^ 


e 

iJt^OGRAM  SIMULA 

[;*NERATES  MULTIVARIABLE  UNIVARIATE  SAMPLE 


LE  NAME:  DFILl 


TMENSION  (M)  =  10 


^EFFICI  ENT 

M  1.0000 

MATRIX  (C) 
0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 
■  0.0000 

0.0000 

1.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

vi  0.0000 
0.0000 

0.0000 

0.0000 

1.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 
h-:  0.0000 

0.0000 

0.0000 

0.0000 

1.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

1.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

1.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

1.0000 

0.0000 

^  0.0000 
0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

1.0000 

-v  0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

1.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

1.0000 

g  1.0000  0.0000 

SIGMA  (SCALING  PARAMETER  FOR  INDEP  VARIABLES)  = 

(INTERCEPT)  =  1.0000 

10.0000 

^\ra:'1eter  vector  (e)... 

!■  1.0000  1.0000  -1.0000  1.0000  1.0000  -1.0000  1.0000  1.0000 
'•  -1.0000  1.0000 


;>IG  (MODEL  ERROR  STD  DEV)  =  1.0000 


NO  OF  OBSNS  =  100 

KSKIP  0  RANDOM  NOo  PRIOR  TO  SIMULATION 


VARIANCE  MATRIX  0?  X,  Y... 


I 


I 


100.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

100.0000 

0.0000 

100.0000 

0 . 0000 

0.0000 

0.0000 

0.0000 

100.0000 

0.0000 

0.000  0 

100. 00  no 

o.nooo 

0.0000 

0.0000 

-100.0000 

0.0000 

0.0000 

0.0000 

100.0000 

0.0000 

0.000  0 

100.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

100. 0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

o.ooco 

-100.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

100.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

100.0000 

0.0000 

0.0000 

0.0000 

0.0000 

100.0000 

0.0000 

0.0000 

0.0000 

0.0000 

100.0000 

0.0000 

0.0000 

0.0000 

0.0000 

100.0000 

1  0.0000 

100.0000 

200.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

H  100.0000 

100.0000 

0.0000 

1  0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

100.0000 

100.0000 

200.0000 

200.0000 

M  100.0000 

100.0000 

-100.0000 

100.0000 

100.0000 

-100.0000 

100.0000 

200.0000 

I  (^-0000 

200.0000 

1101.0000 

COEFFICIENT  OF  DETERMINATION  = 

.9991 

itoRRELATION  MATRIX  OF 

X,  Y.  .  . 

1.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

■  0.0000 

0.0000 

.3014 

B  0.0000 

1.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

.3014 

0.0000 

0.0000 

1.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

-.3014 

0.0000 

0.0000 

0.0000 

1.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

.3014 

K?  0.0000 

0.0000 

0.0000 

0.0000 

1.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

.3014 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

1.0000 

0.0000 

0.0000 

0.0000 

0.0000 

-.3014 

^  0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

1.0000 

0.0000 

0.0000 

0.0000 

.3014 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

1.0000 

0.0000 

.7071 

.6027 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0. ocoo 

1.0000 

.7071 

0.0000 

^  0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

0.0000 

.7071 

■  .7071 

1.0000 

.4262 

.  3014 

.3014 

-.3014 

.3014 

.3014 

-. 3014 

.  3014 

.6027 

0.0000 

.4262 

1.0000 

OBSERVATIOi 

N'S .  .  . 

DEPENDENT  VARIABLE  LAST 

{ifrlRST  3 

OBSNS  PRINTED) 

b'BSN  ^30 

1: 

-18.3276 

-15.0855 

-13.7183 

-4.2261 

13.3911 

-20.8667 

-6.9995 

4 .9927 

■;-!  -14.8901 

-9.8975 

12.2946 

lObsn  no 

2: 

4.2114 

6.8285 

-2.4292 

16.4380 

-6.5698 

8.5474 

11.7895 

3.1557 

2.6489 

5.8057 

34.4939 

:>-BSN  NO 

3: 

'  -10.1245 

-8.1323 

-8.0151 

10 . 2270 

6.5942 

-8.9136 

3.7036 

4.4458 

13.3130 

17.7588 

27.6306 

i 

fcS: 
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OGRAM  MREG 

LTIPLE  LINEAR  REGRESSION  ANALYSIS 


I 


^TA  FILE 

NAME:  DFILl 

R)  OF  OBS, 

NO  OF  VARS 

,  INDEX  OF 

DEP  VAR 

IN  REGRESSION,  NO  OF 

INDEP  VARS 

IN  REGRESSION... 

i  100 

11 

11 

10 

Bodices  o? 

INDEP  VARS 

IN  REGRESSION... 

1 

2 

3 

4 

5 

!  ^ 

10 

DATA. . . 

1  ^IRST 

3  OBSNS) 

'  a  -18.3276 

-15.0855 

-13.7183 

-4.2261 

13.3911 

1  ^  -14.8901 

-9.8975 

12.2946 

4.2114 

6.8286 

-2.4292 

16.4380 

-6.5698 

’  ^  2.6489 

5.8057 

34.4939 

-10.1245 

-8.1323 

-8.0151 

10.2270 

6.5942 

13.3130 

17.7588 

27.6306 

j  ^ANS  FOR  , 

ALL  VARIABLES... 

.1809 

-.0144 

-.8847 

-.0300 

-.4503 

-.2816 

-.9428 

.6235 

^ilANDARD  DEVIATIONS  FOR  ALL  VARIABLES... 


11.2156 

10.1579 

i 

8.7924 

11.7031 

CORRELATION 

MATRIX . . 

1.0000 

.0715 

1.0000 

.0254 

.0648 

-.0757 

.2606 

P 

-.0717 

.1217 

t-. 

.0586 

.0191 

.0552 

.0508 

-.0626 

-.0388 

-.0012 

1.0000 

.1444 

■f 

-.0489 

.0787 

*  : 

.6495 

1.0000 

.2755 

.  4226 

-.0381 

.4271 

#ETERMINANT  =  -.2833E-06 


9,8358 

32.6040 

10.7963 

9.8302 

1.0000 

,1868 

1.0000 

.2235 

.1323 

1.0000 

-.1219 

-.0018 

-.0893 

.1240 

.1180 

-.0356 

-.0055 

.0173 

.1513 

.0992 

.0022 

-.1068 

.0704 

.0150 

.0358 

-.0722 

1.0000 

.4118 

.  3835 

6 

7 

8 

-20.8667 

-6.9995 

4.9927 

8.5474 

11.7895 

3.1567 

-8.9136 

3.7036 

4.4458 

-.2456 

.0841 

-.6612 

9.7790 

10.1817 

8.9774 

1.0000 

.1012 

1.0000 

-.0569 

.0051 

1.0000 

-.0386 

.1414 

-.1326 

-.0726 

.1102 

.6675 

-.2632 

.  3084 

.5941 

DETPIEMINANT,  Fx’EGRESSION  CANNOT  BE  PERFORT'lED 


I 


I 

t 

I 

I 

I 


OGRAM  I TER EG 

TIMATES  REGRESSION  COEFFICIENTS  BY  AN  ITERATIVE  METHOD 

TA  FILE  NAME:  DFILl 

OF  OBS,  NO  OF  VARS,  INDEX  OF  DEP  VAR 
REGRESSION,  NO  OF  INDEP  VARS  IN  REGRESSION... 

100  11  11  10 

DICES  OF  INDEP  VARS  IN  REGRESSION... 

1  2  3  4  5 

9  10 


REGRESSION  METHOD  USED  AT  EACH  ITERATION:  WALD'S  METHOD 
INT  AFTER  EVERY  2  ITERATIONS 


MTA.  .  . 

^IRST  3  OBSNS) 
£’•-18.3276  -15.0855 

-14.8901  -9.8975 

I?  4.2114  6.8286 

£  2.6489  5.8057 

-10.1245  -8.1323 

M  13.3130  17.7588 


-13.7183 

12.2946 

-2.4292 

34.4939 

-8.0151 

27.6306 


-4.2261 

16.4380 

10.2270 


13.3911 

-6.5698 

6.5942 


-20.8667 

8.5474 

-8.9136 


-6.9995  4.9927 

11.7895  3.1567 

3.7036  4.4458 


CANS  FOR  VARIABLES  IN  REGRESSION, 
i^PENDENT  VARIABLE  LAST... 


li  .1809 

-.0144 

-.8847 

-.0300 

-.4503 

-.2456 

.0841 

-.6612 

■  -.2816 

-.9428 

.6235 

i^DIANS. .  . 

g  .1880 

.8130 

-.7495 

.5786 

-.8276 

-.2026 

-1.4136 

-1.2183 

-.0464 

-1.6943 

1.4197 

RtERQUARTILE  RANGES... 

15.4688 

14.0235 

12.7734 

16.7969 

13.5547 

10.8985 

15.6250 

10.9766 

,,  12.5390 

15.0781 

36.7578 

^Degression 

COEFFICIENTS 

,  INTERCEPT... 

.9995 

.9142 

-1.0840 

.9995 

.9548 

-1.0296 

1.0589 

2.0564 

m  .0479 

-.0360 

.9540 

^  .9978 

.9977 

-.9735 

.9893 

.9673 

-.9902 

1.0010 

2.0465 

.00  39 

-.0265 

1.0633 

w  .9975 

.9951 

-.9748 

.9900 

.9668 

-.9901 

1.0019 

2.0334 

m  -.0085 

-.0135 

1.0620 

“  .9975 

.9951 

-.9749 

.9900 

.9668 

-.9901 

1.0019 

2.0205 

*  --0214 

-.0006 

1.0619 

m  -9975 

.9951 

-.9749 

.9900 

.9668 

-.9901 

1.0019 

2.0076 

B  -.0343 

.0124 

1.0619 

■CSIDUAL  VARIANCE  =  1.2721 

JcSIDUAL  STANDARD  ERROR  =  1.1279 

COEFFICIENT  OF  DETER:*1INATI0N  =  .9988 


